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The Correlation of Local Citation 
Data with Citation Data from 
Journal Citation Reports 

Janice Kreider 

University librarians continue to face the difficult task of determining which 
journals remain crucial for their collections during these times of static finan- 
cial resources and escalatingjoumal costs. One evaluative tool, Journal Cita- 
tion Reports (JCR), recently has become available on CD-ROM, making it 
simpler for librarians to use its citation data as input for ranking journals. 
But many librarians remain uncomnnced that tfie global citation data from 
the JCR bears enough correspondence to their local situation to be useful. In 
this project, 1 explore the correlation between global citation data available 
from JCR with local citation data generated specif ically for the University of 
British Columbia, for 20 subject fields in the sciences andsocial sciences. The 
significant correlations obtained in this study suggest that large researclv- 
oriented university libraries could consider substituting global citation data 
for local citation data when evaluating their journals, with certain cautions. 



University librarians continue to 
search for data that helps them evaluate 
their collections, particularly their jour- 
nal collections, as subscription costs rise 
approximately 10% each year and as addi- 
tional funds are needed to pay for access 
to full-text electronic journals. Use stud- 
ies are time-consuming if one wants to ob- 
tain enough data to make the study mean- 
ingful, and the compilation of the results 
can be cumbersome. With the recent ap- 
pearance of Journal Citation Reports 
(JCR) on CD-ROM, quantitative citation 
data are now relatively simple to manipu- 
late. Because the citation data on JCR are 



global, librarians have questioned the rele- 
vance of JCR's data to their own institu- 
tions, preferring citation data generated 
from their own users' publications. These 
local data are more difficult to obtain, par- 
ticularly if one wishes to restrict the data to 
one subject area, which is crucial because 
citation patterns vary by subject field and 
affect citation figures. In order to deter- 
mine how global citation data relates to lo- 
cal citation data, I explored the correlation 
between global citation data from the JCR 
and local citation data for the University of 
British Columbia (UBC) for 20 subject 
fields in the sciences and social sciences. 



Janice KREIDER (jkreider@interchange.ubc.ca) is Coordinator of Collections at the University of 
British Columbia Library, Vancouver. The author wishes to acknowledge the use of the British Li- 
brary Information Sciences Service and the Oxford University Libraries, William Ng of the UBC 
Library Systems Division who provided lists of UBC subscriptions by subject, and Kathy Li and 
Kelly Kwork who were students in STAT 551. Manuscript received June 11, 1998; accepted for 
publication August 18, 1998. 
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Literature Review 

Previous studies on the relationship of 
global and local citation data have been 
few in number and limited to only a few 
subject fields. Wiberley (1982) studied 
the relationship of local and national cita- 
tion data for social work by using an ency- 
clopedia and four periodicals as the 
sources for the national citations and local 
citations from publications from lists of 
faculty publications. He compared the 
method of using earlier national citations 
to predict later local citations with the 
method of using earlier local citations to 
predict later ones and concluded (358) 
that for social work, "national databases of 
citations are almost as good as local data in 
predicting future citation of journals by 
local authors." This conclusion therefore 
suggests that national citation data are 
useful for journal selection. Wiberley's 
study took place in a university known for 
its high productivity in publishing in the 
field of social work, but because the field 
was one of the applied social sciences, the 
field exhibits different citation patterns 
than does a subject in the pure sciences. 

Joswick and Stierman (1997) showed 
that there was little relationship between 
global citation data from JCR and local ci- 
tation data gathered from the three cita- 
tion databases from the Institute for Sci- 
entific Information (ISI) and organized 
using DIALOG'S "rank" command. How- 
ever, all subjects were considered to- 
gether. The university's comparatively 
smaD departments in chemistry and phys- 
ics as well as the lack of a medical program 
meant that there was little correspon- 
dence between the predominance of so- 
cial science titles cited by the professors 
and JCR's top cited titles, which lean 
heavily toward scientific titles. 

Garfield (1972) originally suggested 
using global citation data to evaluate jour- 
nals. Although this practice, along with 
other criteria for evaluating journals, has 
since become fairly common in academic 
libraries, there is continuing discussion 
and debate on the relationship between 
citation data and other indicators of the 
use of journals in libraries. Since 1972, 
there have been numerous studies and 



reviews of the literature. Broadus (1985, 
33) summarized previous studies and 
stated that "counts based on the JCR can 
be almost as good as expensive local [use] 
studies forpredicting use of periodicals in 
a given library." But Broadus cautions 
against automatically eliminating a jour- 
nal that has low global citation counts in 
the library, as there can be valid local rea- 
sons for retaining the title. 

Bensman (1985) summarized re- 
search on the correspondence between 
global citation data and use, and despite 
some conflicting studies, states (24) that 
"ISI citation frequency is measuring an 
extremely powerful variable in academic 
library use, and ... is undoubtedly one 
of the most important measures that can 
be utilized . . . [for] managing the jour- 
nal collections of research libraries." 
Bensman was not concerned that the cita- 
tion data were not local. 

Todorov and Glanzel (1988) reviewed 
studies on the relationship between jour- 
nal citation measures and objective and 
subjective ratings of scientific journals; in 
some of the studies there was a positive 
relationship, while in others there was 
not. Kelland and Young (1994) presented 
a comprehensive review of the literature 
on the relationship between citation data 
and library use. Because the correlation 
data vary, the authors conclude (86) that 
"citations represent a form of literature 
use, and to some extent that can be con- 
sidered library use. . . . Actually, the rela- 
tionship between library use and citations 
is so complex that it should not be ex- 
pected to produce high correlations." 

Swigger and Wilkes (1991) compared 
techniques (including using local citation 
data generated by a DIALOG search of 
ISI) when evaluating journals at Texas 
Woman's University and concluded (52): 
"There is only a weak correspondence be- 
tween use of serials as measured by 
reshelving data and by citation data, and 
no correspondence between citation data 
and the subjective judgments of faculty or 
librarians." The Texas Woman's Univer- 
sity has graduate programs in allied 
health, education, and library science, so 
the results are limited to those fields. On 
the other hand, Bensman (1996) showed 
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a strong correlation in the field of chemis- 
try at Louisiana State University between 
global citation data (in the form of total ci- 
tations from JCR for the journals) and rat- 
ings of those journals by faculty members. 

To summarize, although there is not 
total agreement that the relationship of 
citation data with other indicators of use 
in libraries is strong, it generally is ac- 
knowledged that such a relationship exists 
and that it is worthwhile to take citation 
data into account with other measures 
when evaluating journal titles. 

The question remains whether the ci- 
tation data must be local, or whether 
global data are just as useful. Despite the 
difficulties inherent in generating local 
citation data, authors of articles published 
in the last several years reveal that librari- 
ans are using local citation data in prefer- 
ence to, or in addition to, global citation 
data due to a widespread reluctance to 
rely on global citation data to evaluate 
journals. Haas and Kisling (1994) re- 
ported on a project at the University of 
Florida to evaluate the relevance of their 
science journal collection using local cita- 
tion data produced from ISI. Schmidt, 
Davis, and Jahr (1994) used local citation 
data, faculty ranking, circulation statis- 
tics, and impact factor to prioritize biol- 
ogy journals at the University of Illinois. 
Sylvia and Lesher (1995) used local 
citation data generated from psychology 
theses and dissertations in a cost ratio to 
evaluate their psychology journal collec- 
tion at St. Mary's University. Hughes 
(1995) examined local citation and publi- 
cation data along with global data when 
ranking journals in molecular and cellular 
biology at Pennsylvania State University. 
Dole and Chang (1996) conducted four 
local citation analyses in the fields of 
marine sciences, chemistry, and sciences 
as a whole to produce local citation data to 
use along with faculty rankings and use 
studies in preparation for a journal 
cancellation project at the State Univer- 
sity of New York at Stony Brook. 
Loughner (1996) produced local citation 
data from ISI to evaluate use of science 
journals at the University of Georgia. 
Lambert and Taylor (1996), who evalu- 
ated journals at Staffordshire University 



in the U.K., stated (318): "Citation rank- 
ings seemed too general; we would not 
have felt justified in canceling subscrip- 
tions purely on the basis of externally gen- 
erated lists." 

Statement of the Problem 
and Purpose of the Study 

Citation data are commonly used along 
with other criteria such as reshelving 
statistics and ratings by faculty and librar- 
ians to determine core journal titles in a 
field or to find candidates for cancella- 
tion. Librarians often rely on local cita- 
tion data rather than global citation data, 
despite the ease of gathering global cita- 
tion data and the ability to manipulate 
them electronically, using the CD-ROM 
version of JCR. Librarians nevertheless 
remain unconvinced of the value of global 
citation data, thanks partly to the lack of 
studies. The purpose of this study is to ex- 
plore the correlation between global cita- 
tion data and local citation data to deter- 
mine the potential of relying on global 
citation data instead of local citation data, 
for evaluating a journal collection. A sec- 
ond purpose is to design and test a consis- 
tent method for obtaining correlations 
across a number of subject fields at one 
specific institution. 

Sources of Data 

The local institution involved in this study 
was the University of British Columbia 
(UBC), a pro vinci ally funded university 
with the third largest academic library in 
Canada, a student body of 27,000 under- 
graduates and6,QG0 graduates, and facul- 
ties not only in arts and sciences, but also 
medicine, dentistry, pharmacy, law, engi- 
neering, commerce, education, agricul- 
ture, forestry, etc. Its overall rank among 
Association of Research Libraries (ARL) 
has ranged between 25 and 30 during the 
past 10 years. 

The local citation data were provided 
by the Local Journal Utilization Report 
(LJUR), produced by ISI from its data- 
base specifically for UBC. This report, in a 
database format, contains counts for the 
number of times authors from UBC cited 
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specific journals during a 12.5-year period 
from January 1981 to June 1993. The 
LJUR was generated using all author ad- 
dresses, not solely the first author. 

The 1994 edition of JCR on CD-ROM 
produced by ISI was the source of the 
global citation data for this study. The spe- 
cific global citation data used were the 
"1994 Total Cites," which is the number 
of times a given journal title was noted as 
being cited in ISI's Science Citation Index 
and Social Sciences Citation Index during 
1994. 

The time periods of the two sets of data 
do not overlap; the sources were chosen 
because they were available in electronic- 
format, which facilitated analysis using 
spreadsheets and databases. ISI produced 
the LJUR for UBC in 1993, but the JCR 
was not available on CD-ROM until the 
1994 edition was produced in 1995. 

Because this project was based on ISI 
data, it has all the limitations of that data- 
base: 

• errors in the data themselves 

• the limited number of journals cov- 
ered (an example is the field of law, for 
which only 96 titles are listed in JCR, 
which led to the exclusion of the sub- 
ject of" law in this study) 

• JCR is based only on journals; hence 
citations to journal articles from other 
types of publications are not included, 
and for subject areas that are less jour- 
nal-centric, these might be significant 

• few foreign language titles are included 

• total citations are influenced by the 
length of time the title has been pub- 
lished; titles begun during or near the 
end of the 12.5-year period covered by 
the LJUR in this study will not have ei- 
ther global or local citation data com- 
parable to those from well-established 
titles 

Method 

In order to identify journal titles for a spe- 
cific subject field, two sources were used. 
The first was the JCR, which assigns each 
journal title to one or more subject fields. 
The second was UBC s existing local cod- 
ing of all active subscriptions to one or 
more of the 20 subjects under consider- 



ation, corresponding to departments at 
the university. The subjects assigned by 
the JCR are finer categories than those as- 
signed by UBC. In order to have them 
correspond as much as possible, the JCR 
was filtered to obtain similar subject 
groupings. For example, to correspond 
with the subject of "Chemistry" as de- 
fined by UBC, the following subject cate- 
gories were chosen from the JCR: Chem- 
istry; Chemistry, analytical; Chemistry, 
inorganic and nuclear; Chemistry, or- 
ganic; Chemistry, physical; Electrochem- 
istry; and Spectroscopy. 

The following databases were loaded 
into a local database on a personal com- 
puter: the LJUR, the list of titles for each 
of the 20 subjects from the JCR, and the 
JCR lists of all science titles and social sci- 
ence titles. Using the UBC subject list- 
ings of subscriptions, subject codes were 
keyed into the LJUR. If a UBC subscrip- 
tion was not in the LJUR, the title was 
added, with zero as the number of cita- 
tions. The database was used to create a 
list for each subject, with the following in- 
formation for each journal: (1) its title as 
abbreviated by ISI; (2) the global citation 
count (i.e., the 1994 total citations from 
JCR); and (3) the local citation count (i.e., 
the number of times UBC authors cited 
the title over the 12.5 year period from 
the LJUR). The beginning of one of these 
lists is shown in table 1 for the field of 
zoology. 

The Pearson correlation coefficient 
was chosen to measure the correlation 
between the global citation data and the 
local citation data. The Pearson correla- 
tion coefficient reflects the extent of a lin- 
ear relationship between two sets of data; 
it ranges between -1 (indicating a perfect 
negative relationship) and +1 (indicating a 
perfect positive correlation). A correlation 
close to ±1 indicates a strong correlation. 
Other correlation studies have used the 
Spearman correlation coefficient, which 
measures the correlation of ranked data. 
However, if the raw data are available (not 
just the rankings), it is preferable to use 
the Pearson correlation because the actual 
data give more information, such as the 
varying size of gaps between the ranked 
data. Before applying the Pearson correla- 
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TABLE 1 

Zoology Journal Titles with Global (from ]CR) 
and Local (from LJUR) Citation Counts 



Abbreviated Title 


Global 


Local 


ACAROLOGIA 


1 A 





ACTA AN AT 


1,0 IL 


oo 


ACTA BIOL CRACOV ZOO 


3 


A 
\l 


ACTA PHYTOPATHOL HUN 


97 


A 


ACTA PROTOZOOL 


195 


A 
V) 


ACTA THERIOL 


360 


lo 


ACTA ZOOL HUNG 


33 


A 


ACTA ZOOL-STOCKHOLM 


oy / 


O 


ADV INSECT PHYSIOL 


4 ZD 


04 


ADV PARAS IT 


481 


12 


ADV STUD BEHAV 


3 3d 




AFR J ECOL 


lo9 


tr 


AM BEE J 


155 





AM ENTOMOLOGIST 





4 


AM ] PHYS ANTHROPOL 


Z,04Z 




AM J PHYSIOL 


78,546 


2,634 


AM J PRIMATOL 


826 





AM MALACOL BULL 


79 


4 


AM ZOOL 


2,652 


202 


AMPHIPACIFICA 









tion, the citation figures were transformed 
using the logarithmic transformation. The 
decision to transform the data was based 
on an examination of the histograms of the 
data for the field of microbiology. For that 
field, both the global and the local citation 
data sets were dominated by a small num- 
ber of highly cited journals; in statistical 
terminology, the variance increases with 
the mean, resulting in a negative binomial 
distribution. The other subject fields ap- 
peared to have similar distributions. Fig- 
ure 1 illustrates the dominance in the field 
of zoology by one title, in this case the 
American Journal of Physiology, for both 
local and global citations. These observa- 
tions agree with those of Bensman (1996), 
who has worked with similar citation 
counts and who noted that the logarithmic 
transformation is an appropriate way of 
dealing with such data. 



Unfortunately this procedure was not 
quite as uncomplicated as it would appear. 
As noted by Haas and Kisling (1994), 
Loughner (1996), and Harter (1998), raw 
ISI data can be disconcertingly inconsis- 
tent. The LJUR required extensive editing 
in order to combine titles with different 
abbreviations, to combine titles that had 
changed, and to correct errors. When the 
editing was completed, a great many titles 
listed with only one citation remained, and 
many of these were difficult to identify by 
full title, either because they were eso- 
teric, ambiguous, incorrectly abbreviated, 
or referred to monographs. Examples of 
such titles include PHYSIS, PLENARY 
LECTURE, and TXB PSYCHIATRY. In 
order to eliminate having to spend undue 
time identifying these titles of little im- 
portance and in order to make the data- 
base smaller and easier to manipulate, all 
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Figure 1. Scatter Plot of Raw Data for Zoology Titles. 



of the titles having only one citation were 
eliminated except: (1) those in which 
UBC authors had published {using the 
publislung data from the LJUR); and (2) 
titles that began with CAN, CANADA, 
CANADIAN, BC, BRITISH COLUM- 
BIA, VANCOUVER, etc., because we 
wished to retain as much Canadian data as 
was feasible. Although not necessary for 
this correlation study, obvious mono- 
graphs; titles indicated as INPRESS, 
UNPUB, and THESIS; and titles begin- 
ning with numbers or abbreviations of 
months were eliminated. 

Another modification to the LJUR was 
its expansion to include 755 titles in the 20 
subject fields to which UBC actively sub- 
scribes. These titles had a local citation 
count of zero, but 175 had global citation 
data from JCR. The primary reason for 
adding these titles was; to analyze all UBC 
subscriptions in the 20 subject areas for 
correlation between global and local cita- 
tions. The effect for this study was an ex- 
pansion of the data beyond that of ISI as 
the source. The periodicals added in- 
cluded esoteric titles, popular titles such 
as Sky and Telescope, new titles that had 
not existed long enough to have received 
citations at UBC, foreign titles, and titles 
perhaps a bit outside a strict definition of 
journal such as the Yearbook of Astron- 
omy. Modifications to the JCR were also 



needed, because it was already several 
years old and numerous titles had 
changed or split into parts, so data for the 
various versions of a title had to be gath- 
ered under the latest version of the name . 

The LJUR ended up with 10,601 titles, 
with local citations ranging from a high of 
5,350 for the Proceedings of the National 
Academy of Sciences to the 755 titles with 
a citation of zero. The JCR for the science 
fields contained 4,438 titles, with global 
citations ranging from a high of 265,329 
for the Journal of Biological Chemistry to 
a low of one citation for 13 titles. The JCR 
for the social science fields contained 
1,402 titles, with global citations ranging 
from a high of 20,038 for Archives of Gen- 
eral Psychiatry to a low of one citation for 
four titles. The two JCR sets had an over- 
lap of 148 titles. The LJUR and the JCR 
for the sciences overlapped by 3,212 titles 
(72% of/CR), and the LJUR and the JCR 
for social sciences overlapped by 946 
titles (67% of the JCR titles). 

Discussion of Results 

Table 2 fists the 20 subjects that were an- 
alyzed for this study, arranged in de- 
scending order by correlation of the data 
after they had undergone a logarithmic 
transformation, where N is the number 
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TABLE 2 

Correlation of Global and Local Citation Data 



Subject Field 


N 


Feaison 
Correlation 
of Raw Data 


Pearson Correlation 
Transformed Data 


Microbiology 


89 


0.953 


0.814 


Forestry 


140 


0.817 


0.810 


Astronomy 


42 


963 


0.755 


Economics 


260 


0.916 


0.731 


Business and Management 


279 


0.849 


0.695 


Biochemistry 


272 


0.959 


0.687 


Pharmacy 


176 


0.832 


0.682 


Computer Science 


254 


0.696 


0.681 


Mining 


39 


0.617 


0.677 


Biology 


346 


0.968 


0.675 


Botany 


154 


0.783 


0.661 


Mathematics 


307 


0.800 


0.634 


Zoology 


220 


0.927 


0.634 


Physics 


343 


0.903 


0.625 


Mechanical Engineering 


177 


0.816 


0.623 


Librarianship 


135 


0.613 


0.622 


Geology 


208 


0.818 


0.607 


Chemistry 


401 


0.871 


0.567 


Metals and Materials 


212 


0.579 


0.547 


Chemical Engineering 


148 


0.477 


0.530 



N = number of journal titles in each subject 



of journal titles in each subject cohort. 
The correlations between the global and 
local citation counts using the raw data 
were extremely high in most fields, a find- 
ing that was not corroborated by visual in- 
spection of the scatter plots. Only five 
suhjects had a correlation below .7, but as 
mentioned previously, the skewed distri- 
bution of the data (large clusters of both 
global and local citation data dominating 
the low end) indicates that more realistic 
correlations can be obtained from the 
transformed data. This led to the ratio- 
nale for arranging the table in descending 
order by correlation of the data after a log- 
arithmic transformation was performed. 
The actual ranking of the subjects is less 
important than the fact that the correla- 
tion figures are moderate to moderately 
high for all the subjects. The scatter plots 
of the transformed data show a linear rela- 
tionship of varying strength for each sub- 



ject, a relationship that gets decidedly 
weaker for the smaller citation values. 
Figure 2 shows the relationship for zool- 
ogy. Note the cone shape of the data 
points that gets wider for the smaller 
values, indicating the increasing weak- 
ness of the relationship as the citation val- 
ues decrease. The fact that the linear rela- 
tionship between the global and local 
citation transformed data gets weaker for 
the smaller values has implications for 
libraries. It means that a low global cita- 
tion count does not always imply a corre- 
spondingly low local citation count, de- 
spite the moderate to relatively high 
correlation figures for all the subjects. 

The placement of the various subjects 
in the ranked list in table 2 does not per- 
mit any general conclusions about the 
strength of a subject's correlation based 
on whether the subject is a pure or an ap- 
plied science, a science or a social sci- 
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Figure 2. Scatter Plot of Transformed Data for Zoology Titles 



tmce, or the degree of a subject's depend- 
ence on journal publishing. Neither does 
there seem to be a relationship between 
the number of titles included in a subject 

An interesting phenomenon is that 
local citation data are zero for numerous 
journals in all 20 subject fields. There 
seem to be three reasons for this. First of 
all, 755 titles were imported into the sub- 
ject groupings because they were identi- 
fied with tbe subject by UBC s local coding 
of all active subscriptions but were not in- 
cluded in the original LJUR due to the 
title's obscurity (e.g., Amphipocificd) or 
because the title was a popular rather than 
a research title (e.g., Audubon). A second 
reason is that most of the titles from the 
LJUR that had only a single citation were 
culled. Finally, some of these titles are sim- 
ply of little relevance to research at UBC 
(e.g., American Bee Journal). 

It might be assumed that researchers 
at UBC would cite Canadian journals 
more heavily than would be the case glob- 
ally, which would produce relatively 
lower correlations in this study than 
would be obtained for an institution in the 
U.S. However, each subject field includes 
only a few Canadian titles, and often only 
one is truly important, so the effect is 
minimal. The subject list for business and 
management includes 25 Canadian titles, 
more than the other disciplines. Exclud- 
ing the 25 Canadian titles and then recal- 
culating resulted in a correlation of .702, 
compared with the correlation of .695 
when the Canadian titles were included. 
However, the opposite effect occurred in 
forestry, which included 14 Canadian 



titles; recalculating the correlation with- 
out the Canadian titles gave a correlation 
of . 794 instead of the .810 when the Cana- 
dian titles were included. 

The JCR data (1994) and the LJUR 
data (1981-93) obviously do not come 
from the same time periods. At the time 
of this study, there was only one year of 
JCR data available on CD-ROM, and it 
might have seemed more worthwhile to 
have used only one year of local data (and 
from the same year) to correspond pre- 
cisely with the one year of global data. 
However, for a given year, local citation 
data are considerably sparser than global 
citation data. Just as use studies based on 
reshelving counts require a long time 
frame to be representative, a citation 
study using merely one year of local cita- 
tions would not be as indicative of local ci- 
tation practices as more years would be, 
both in terms of the variety of titles cov- 
ered and citation counts themselves, 
making a reliable correlation figure diffi- 
cult to obtain. Whether a full 12.5 years of 
local citation data was necessary is open to 
debate. Certainly with only one year of lo- 
cal data, the correlations would have been 
much lower. Although JCR data change 
from year to year (probably more so for ti- 
des with few citations), they are still rela- 
tively stable— likely more stable due to 
sheer size — than are local citation data. 
Line (1985) found a 92% overlap for the 
top 500 journals cited the most fre- 
quently in ISI's Science Citation Index 
(SCI) in 1979 and in 1982. For the Social 
Science Citation Index (SSCI), there was 
an 87% overlap for the same time period. 
Hence, using JCR data (which are drawn 
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from SCI and SSCI) from a different year 
than that covered by LJUR should have a 
relatively minor impact on the results. 
The local citation data on LJUR would 
vary more from year to year than do global 
data. ISI has since enhanced the product 
to indicate the years of the citations, mak- 
ing it possible to study the growth or de- 
cline in citations for a particular title. 

The nature of this project required 
that it be specific to our institution, and 
factors unique to UBC might have af- 
fected the data in unknown ways. There- 
fore, the results should not be over- 
generalized for use by other institutions 
but rather should be viewed as providing 
an exploratory study of correlations for 
various subject fields using a specific 
method. The relative correlation for each 
subject is of less importance than the fact 
that a moderate to moderately strong cor- 
relation was shown consistently for all 20 
subjects. This allows others to consider 
the similarity of their situation to UBC 
and to decide whether the size of the re- 
sulting correlations merits their substitut- 
ing global citation data for local citation 
data when determining the importance of 
a journal to their library collection. 

Because both the JCR data and the 
LJUR data come from ISI, it could be 
argued that the result has a high inherent 
correlation because ISI's database is 
based only on journals and the set of jour- 
nals is limited. A better method might be 
that of obtaining the local citation data 
directly from local publications of various 
types, as has been done by some librarians 
when evaluating journals, such as Sylvia 
and Lesher (1995), whose source of local 
citations was graduate theses in psychol- 
ogy, and Dole and Chang (1996), who 
obtained local citations not only from a 
search of ISI but also from a list of publi- 
cations by their faculty in marine sciences 
and from doctoral theses in chemistry. 
The local citations produced by the list of 
publications of the marine science faculty 
were dispersed over more titles than the 
ISI study, but the years of coverage did 
not coincide by date or length. Obtaining 
local citation data from sources other than 
ISI could produce more data, particularly 
in fields where the nature of publishing 



includes a substantial proportion of non- 
journal formats, such as monographs, 
conference proceedings, preprints, and 
technical reports. Obtaining local citation 
data from sources other than ISI, how- 
ever, has the disadvantage of being ex- 
tremely time-consuming to compile and 
organize; consequently past studies have 
been limited to few subject fields. It is dif- 
ficult to evaluate studies of one subject 
area in isolation, and it is next to impossi- 
ble to compare studies in which wildly dif- 
ferent methods were used. The obvious 
advantage of the method presented here 
is that it can be done on a personal com- 
puter, and allows numerous subjects to be 
examined and results compared. 

Implications for Libraries 

It should be noted that the usefulness of 
both global and local citation data is lim- 
ited to evaluating titles lor research pur- 
poses; it has little validity for evaluating 
titles intended primarily for student use, 
particularly undergraduates. Joswick and 
Stierman (1997) showed that titles used 
by students do not bear any relationship 
to the number of citations in faculty pub- 
lications. Moorbath (1993) studied the 
nursing literature and concluded (44) 
that there is "a significant difference 
between rank by student use and rank in 
the Citation Index." On the other hand, 
Zipp (1996) found a positive association 
between faculty citations and citations in 
theses and dissertations. It should also be 
remembered that results of this study are 
not applicable to smaller institutions that 
cannot offer as broad a base of ongoing re- 
search and publishing, because the local 
research might be less likely to corre- 
spond with global research and thus prob- 
ably result in far lower correlations be- 
tween local and global citation data. 
Swigger and Wilkes (1991, 44) stated that 
using local citation data is "likely more 
valid, particularly for small to medium 
institutions where research is more lim- 
ited in scope and number of projects than 
at large universities with large faculties." 
Sylvia and Lesher (1995) used local cita- 
tion data but also advocated considering 
shelving counts. 



76/ LRTS • 43(2) • Kreider 



The results of this study do not give us 
information on whether the titles cited in 
a specific subject field were cited by re- 
searchers in that particular subject de- 
partment. Because the data from the 
LJUR span the entire university, cita- 
tions to a journal could easily be made by 
someone from a department in a differ- 
ent subject area. This has implications 
for academic department libraries, 
which are interested in limiting the eval- 
uation of journals to their specific sub- 
ject area or perhaps primarily are inter- 
ested in the research scope of their 
specific department. The growing inter- 
disciplinary use of journals and their es- 
calating costs mean that most institutions 
have to take an institution-wide view to 
their library collections, so for them the 
department of the researcher is increas- 
ingly irrelevant. 

Conclusion 

University librarians are always searching 
for ways to evaluate journals, due to the in- 
creasing costs of journals, the creation of 
new journals, and the limitation of funds. 
Data often considered include circulation 
and reshelving studies, ratings and rank- 
ings by f aculty members, local and global 
citation counts, impact factors, availabil- 
ity of indexing, reputation of the pub- 
lisher, cost, language, interlibrary loan 
requests, avail ability from document sup- 
pliers, etc. All this information has value 
when considered along with subjective 
evaluations by faculty and librarians. Re- 
cently, librarians such as Schmidt, Davis, 
and Jahr (1994), Hughes (1995), Sittig 
and Kaalaas-Sittig (1995), and Loughner 
(1996) have proposed various tools or in- 
struments for evaluating journals that 
combine two or more of the following fac- 
tors: a ranking or rating by faculty mem- 
bers, circulation statistics, local or global 
citation counts, local publication counts, 
costs, and impact factors. 

In this present study, it was shown 
that a relatively high correlation exists 
between global citation data ol one year 
and longitudinal local citation data for a 
preceding 12.5-year period for a large re- 
search-oriented university for 20 subject 



fields, suggesting that large research- 
oriented university libraries could con- 
sider substituting the more easily- 
obtainable global citation data (from 
JCR) for local citation data (whether 
from ISI's LJUR or a search of their cita- 
tion indexes or from an analysis of local 
publications) when evaluating their jour- 
nals. High global citation counts have 
been shown to correlate with local cita- 
tion counts. However, because the rela- 
tionship between the global and local ci- 
tation data gets weaker for the lower 
values of the data, librarians should exer- 
cise caution when evaluating titles with a 
low number of global citations and seri- 
ously take other factors into account. Fac- 
ulty members are likely to be more inter- 
ested in local data, so if local citation data 
are readily available such as ISI's LJUR, 
they can serve a double purpose as a tool 
of interest to the faculty along with being 
of use to the librarian, who, because of the 
correlation of global and local data, has 
little need to consider global citation data 
in addition. 
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Pebbles for the Mosaic of 
Cataloging Expertise: What Do 
Problems in Expert Systems for 
Cataloging Reveal about 
Cataloging Expertise? 

Alenka Sauperl and Jerry D. Saye 

Twenty-five years of research in expert systems for descriptive cataloging 
and related areas are reviewed. Researchers who developed prototype expert 
cataloging systems in the 1980s found that cataloging rules are' extremely 
comprehensive and complicated, 'but still insufficient to permit these systems 
accurate cataloging results. Those researchers also identified smaller areas 
that need to be pursued for successful implementation of expert cataloging 
systems. Subsequent research has focused on narrower areas. Cataloging 
rules were studied and more was learned about problems with their logic 
structure and organization. Anotlwr focus was optical reading of biblio- 
graphic elements in documents. Categories of responsibility and graphic de- 
sign continue to pose problems. The visual characteristics of documents were 
studiedia understand more about thnmh-mrtik- a-comntion oftrihlU >gnipUu: 
elements necessary for the bibliographic description of documents. The sys- 
tematic study of the cataloging process, necessary for development of expert 
systems, may result in improvement of manual working procedures and en- 
rich the education of new catalogers. 



Cataloging and Expert Systems 

Cataloging is one of the most time- 
consuming tasks performed in libraries. It 
is not surprising then that, since the intro- 
duction of computers into library opera- 
tions, there have been numerous at- 
tempts to automate the cataloging 
process. The early use of computers to 
create catalog records that were used 



most often to produce cards for card cata- 
logs has been largely supplanted by the 
creation of bibliographic records that are 
used to populate online catalogs. The 
most challenging aspect of cataloging is to 
determine the content of the biblio- 
graphic record rather than to create the 
record itself. This is where research in ex- 
pert systems and related areas plays an 
important role. 
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In expert systems computers are used 
to organize and preserve the expert 
knowledge of catalogers and to employ 
that knowledge in cataloging with mini- 
mal or no human intervention. Unfortu- 
nately, few of these prototype cataloging 
expert systems have succeeded in the real 
world. They have contributed, however, 
to an understanding of cataloging pro- 
cesses and standards, as well as to an un- 
derstanding of the documents cataloged. 
When researchers in this area deter- 
mined that the entire cataloging process 
was too complex to allow for full expert 
system implementation, they then con- 
centrated on some elements of the cata- 
loging process, such as cataloging rules, 
document characteristics, and the charac- 
teristics of names used as access points. 
These efforts resulted in a better under- 
standing of both the advantages and limi- 
tations that computers offer to catalogers. 
Read from another perspective, however, 
the problems found in having computers 
create cataloging might be indicative of 
the problems catalogers themselves face. 
This might be particularly true for novice 
catalogers and students of cataloging. 

Cataloging standards are an essential 
part of the knowledge base for the cata- 
loging process. They state the purpose, 
procedure, and results of the process. The 
most important standards used in catalog- 
ing in the United States are the An- 
glo-American Cataloguing Rules, 2d ed. 
(AACR2) and the documentation that 
supports the MAchine-Readable Cata- 
loging format (MARC). These are supple- 
mented by national and local written poli- 
cies and guidelines as well as by the 
unwritten practices and policies of librar- 
ies, library systems, and bibliographic 
utilities. Catalogers must learn and mas- 
ter all of these in order to perform their 
jobs successfully. 

The purpose of a catalog is to create a 
surrogate of a document to support access 
to that document. A variety of document 
formats are represented in library cata- 
logs. Document characteristics, in combi- 
nation with expected retrieval needs, 
should determine what is contained in the 
catalog record. Catalogers learn to iden- 
tify those characteristics, interpret them 



in accord with the existing rules, and rep- 
resent them in meaningful, standardized 
descriptions. 

The strength of developments in ex- 
pert systems relative to cataloging is the 
process of knowledge acquisition. In this 
process, specific cataloging knowledge, 
decisions, and work procedures are sys- 
tematically studied. The questions moti- 
vating the research described in this pa- 
per were: What have we learned in the 25 
years of research into expert cataloging 
systems about the characteristics of the 
cataloging process, the materials cata- 
loged in that process, and records created 
in the process? If expert systems do not 
provide the answer to cataloging prob- 
lems now, can a better understanding of 
what cataloging is, characteristics of its 
processes, objects (materials), and results 
(records) help to improve the quality of 
work and the training process? 

Defining Expert Systems 

"Expert systems are computer-based 
systems that use knowledge and reason- 
ing techniques to solve problems that 
would normally require human expertise" 
(Morris 1992, 1). The basic elements of 
expert systems include a knowledge base, 
an inference engine, an interface, and a 
general database. Each is described below. 

The knowledge base is where human 
expertise is organized and stored. This 
base contains facts and heuristics of the 
domain of expertise. In the case of auto- 
mated cataloging, the knowledge base 
would contain the current cataloging 
rules and modifications for local practice 
as well as the experience of catalogers. 

The inference engine enables the for- 
malized decision-making process. It 
makes use of the knowledge base to en- 
able the system to perform like an expert. 
The engine infers results based on the 
stored knowledge. 

The interface supports communica- 
tion between humans and the system. 
Usually, the interface has three compo- 
nents . First, a user interf ace enables com- 
munication with the user (the cataloger 
who uses the system). Second, a devel- 
oper's interface assists the knowledge 
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engineer (the person who develops the 
expert system) and makes the develop- 
ment of the system possible. Third, an ex- 
ternal interface provides for data ex- 
change with external sources (e.g., 
authority files or classification schedules). 

The general database keeps track of 
the current problem and records all the 
relevant data and steps in the process. 

Expert systems have shown a great 
deal of potential in the cataloging arena 
for several reasons: 

• Human catalogers can be reassigned 
from routine tasks to work on more ex- 
citing and creative tasks. Thus, cata- 
logers could leave simpler items to the 
expert system and work primarily with 
documents that are more difficult to 
catalog. Similarly, greater resources 
might be dedicated to other catalog- 
ing tasks such as enhancement of sub- 
ject access. 

• Expertise, which is rare and difficult 
to acquire, could be archived and 
saved for the future. When experi- 
enced catalogers leave a library, ex- 
pert systems could be used as consul- 
tants or even as sources of training for 
less-experienced colleagues. 

• Expertise could be distributed more 
widely and used more readily. In con- 
trast to human experts, expert systems 
could be used 24 hours a day by multi- 
ple users at different sites. 

• Critical examination of the decision 
process could be enhanced. The anal- 
ysis of cataloging tasks necessary for 
the development of the system would 
force developers to 
describe the ' 

/errors and il- 
logical steps in the workflow that were 
not obvious before. 

• Full understanding of all parts of the 
process could help in the develop- 
ment of standardized approaches to 
problem solving for particular tasks. 
These standardized procedures might 
result in more consistent catalogs as 
well as support the training of novices. 
With all their positive equalities, ex- 
pert systems would seem to be the obvi- 
ous catalogers tool of the future. How- 
ever, the realization of this ambition has 



been impeded by a number of challeng- 
ing obstacles: 

• The building of an expert system re- 
quires a substantial amount of time 
and work from the builders of the sys- 
tem (the knowledge engineers) and 
from the experts in the particular do- 
main (here, experienced catalogers). 

• The expert system's domain needs to 
be carefully chosen within a narrow 
and well-defined area if the system is 
to be reliable. Despite original hopes, 
researchers have found that descrip- 
tive cataloging is too broad a domain. 
Although cataloging rules exist, they 
have not proven sufficient for accu- 
rate cataloging. Researchers have also 
found that cataloging rules are diffi- 
cult to transfer into a knowledge base. 
In studies, it has been indicated that 
only when cataloging is divided into 
small subdomains will the resulting 
expert system be successful. 

• Knowledge engineers, who are re- 
sponsible for preparing the computer- 
ized version of the human expert, 
need to know the computer system 
well. They also need to be familiar 
with the expert domain in order for 
the required knowledge to be repre- 
sented in the system in a meaningful 
and useful way. 

• Knowledge engineers, who are also re- 
sponsible for acquiring knowledge from 
human experts, frequendy have been 
unable to get those experts to express 
completely what they think and do. 

To explore more fully the potential of 
expert systems for use in cataloging oper- 
ations, we reviewed 25 years of research. 
The studies examined yield important 
information on the fundamental require- 
ments for designing the essential knowl- 
edge base for creating a standard catalog- 
ing record. 

Prototype Expert Systems 

Davies and James (1984) were the first to 
investigate the feasibility of creating an 
expert system for cataloging. They tried 
to develop a system that would provide a 
complete catalog entry (computer record 
or catalog card). Their system required a 
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cataloger to choose options from a menu 
about the type of publication under con- 
sideration and access points. After a series 
of questions about the publication, the 
system offered a template, called a frame, 
for the cataloger to record the remaining 
data for bibliographic description. Davies 
and James found their system very com- 
plex and difficult to manage. Two main 
reasons were identified: (1) AACR2 is 
very complex; and (2) the general data- 
base, which kept track of the current 
problem, needed more space than their 
computer systems was able to provide at 
that time. In essence, the cataloging pro- 
cess was more complex than the existing 
technology available was able to manage. 

Davies (1987) explained further how 
frames could be used in cataloging to in- 
fer and complete particular fields in a 
MARC record. He believed such a system 
should be able to infer who the publisher 
was from the International Standard 
Book Number (ISBN) and provide the 
complete publisher element. This could 
reduce some routine typing and typo- 
graphical errors. Although not specifi- 
cally stated by Davies, it is clear that accu- 
racy — particularly in spelling — is vital. 
Errors in these areas are not trivial for 
they can have substantial impact on later 
retrieval. A good expert system would 
support the cataloging process with edit- 
ing input. 

Cataloging expertise, however, often 
lies in the choice of appropriate rules for 
bibliographic description and the choice 
of access points rather than in the simpler 
inference of publisher or the consistent 
spelling of words. The system created by 
Davies and James was intended to guide 
catalogers through the construction of the 
bibliographic record by providing rules 
relevant to each particular element of the 
description. Ultimately, this was found to 
be too complex for computer systems of 
the early 1980s. 

While Davies and James worked on 
their system, Hjerppe and Olander 
(1989) built the Expert System for Simple 
Choice of Access Points for Entries 
(ESSCAPE). Hjerppe and Olander's sys- 
tem asked questions relevant to access 
points and the cataloger answered them. 



This interaction resulted in the system 
suggesting the AACR2 rule number ap- 
propriate for given situations. Hjerppe 
and Olander eventually abandoned fur- 
ther development of this system because 
they concluded that AACR2 alone was 
not sufficient to allow for the automatic 
decision making in the selecting of access 
points. They found that human interpre- 
tation was essential because the catalog- 
ing rules were neither self-contained nor 
clearly formulated. 

Evidently, the experience of catalog- 
ers enables them to develop more specific 
rules to given situations than are provided 
by the cataloging code for decision mak- 
ing. Varied experience can lead to differ- 
ences in the interpretation of the infor- 
mation not provided for by the cataloging 
rules. Different or inconsistent infer- 
ences for missing information can result 
in inconsistent catalog records. 

Gibb and Sharif (1988) also created an 
expert system to assist catalogers in creat- 
ing full catalog records. By following 
menu options in the system and using 
"yes/no" and numbered decision alterna- 
tives, a cataloger was led to the appropri- 
ate AACR2 rule. Based on their experi- 
ence, Gibb and Sharif commented that 
using only AACR2 as a knowledge source 
was appropriate. This would allow the 
lengthy process of knowledge acquisition 
from experts to be bypassed and avoid 
personal bias. Gibb and Sharif suggested 
that the system was particularly useful for 
training new catalogers. 

Gibb and Sharif's optimistic conclu- 
sions contradict those of Hjerppe and 
Olander. This difference may be due to 
different expectations. Hjerppe and 
Olander were looking for a system that 
used AACR2 to generate correct access 
points for the variety of items encoun- 
tered in a real environment, and con- 
cluded that AACR2 was insufficient for 
this purpose. Gibb and Sharif, on the 
other hand, developed a system for the ac- 
curate use of AACR2 and were in general 
less concerned with cases in which those 
rules did not give sufficient guidance. 

If cataloging is so complex, it seems 
reasonable to expect that a successful sys- 
tem might better be limited to addressing 
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specif ic document types. This would likely 
result in fewer rules being required to make 
the system operational. Ercegovac (1990) 
and Ercegovac and Borko (1992) fol- 
lowed this approach and created a 
semi-automatic cataloging advisor to as- 
sist catalogers in the cataloging of maps. 
Their system, "Mapper," helped catalog- 
ers establish main entry, title statement, 
statement of responsibility, publisher, 
place, and year of publication. Their ex- 
pert system differed from previous efforts 
because the developers incorporated 
principles from: 

• the system's performance testing; 

• user interface design; 

• diff erent sources of knowledge for the 
development of the knowledge base; 
and 

• knowledge elicitation methods to gain 
the necessary knowledge from map 
cataloging experts (Ercegovac 1990). 
Ercegovac's method for system perfor- 
mance evaluation was to examine the re- 
sults (the catalog records) and the user in- 
terface. Catalog records produced 
through Mapper were compared to the 
catalog records for the same maps pre- 
pared by the Library of Congress (LC). 
This standard for comparison was chosen 
because LC records are often treated as 
the U.S. national standard for quality cat- 
aloging. Although tested on only a small 
sample of maps, Mapper performed well 
and its user interface was well accepted by 
its users. Overall, it was perceived as help- 
ful, easy to learn and fast. 

Davies and James (1984) used catalog- 
ing rules and local guidelines as sources of 
expertise while Gibb and Sharif (1988) 
and Hjerppe and Olander (1989) used 
different parts of cataloging rules and, 
very likely, their own personal knowledge, 
although this is not mentioned in their re- 
ports. All commented on the weaknesses 
of cataloging rules and the adjustments 
that would be necessary to implement 
them in the knowledge base of an expert 
system. Ercegovac (1990) showed how to 
overcome these weaknesses with addi- 
tional sources of expertise. First, one 
must observe common features of the 
documents. The printed logos, number 
codes, and the fold of a map can give clues 



to its producer. Second, one must inter- 
view experienced catalogers about their 
understanding of documents and the pro- 
cess of map cataloging. Because the rules 
don't clearly identify the bodies responsi- 
ble for a map, that information was gath- 
ered from catalogers and developed into a 
definition. Additionally, Ercegovac in- 
quired about the experts' approach to cat- 
aloging and the cataloging process. 

One area of cataloging that novice cat- 
alogers must master is knowledge of the 
sequence of appropriate cataloging rules 
to use in particular cases. Each of the 
expert systems guided the cataloger 
through a sequence of questions, suggest- 
ing the appropriate rule but requiring the 
cataloger to provide the necessary biblio- 
graphic data. This cooperation resulted in 
the creation of a partial or complete bib- 
liographic record. Davies (1987) later 
suggested that cataloging systems inter- 
faces could provide assistance with tem- 
plates by interpreting some data across 
different fields of the record. Interfaces 
could also assist by prompting catalogers 
for required data following an established 
pattern of rule sequences. 

It appears obvious from these re- 
search efforts that traditional cataloging 
is successful because it incorporates 
printed standards, national and local poli- 
cies (written and unwritten), and the ex- 
perience and knowledge of the cataloger. 
It is very likely that in order for an expert 
system to be successful it must incorpo- 
rate all these elements. These also are ar- 
eas that novice catalogers need to master 
over time through acquisition of experi- 
ence. Library and information science 
programs can usually provide some in- 
struction on rules and document charac- 
teristics, but rarely are able to provide the 
development of an experience base at a 
level sufficient for quality cataloging. 

It may come as a surprise that the de- 
velopment of expert systems essentially 
ends with these described prototype 
systems. The basic unit of an expert sys- 
tem for cataloging still has not been 
resolved — that is, the structure and orga- 
nization of the cataloging rules appropri- 
ate for computer use. Without that funda- 
mental unit in place, the tedious and 
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expensive process of cataloging knowledge 
acquisition for developing the knowledge 
base seems to lead to no usable product. 

Although Ercegovac (1990) demon- 
strated how an expert system could over- 
come the difficulties of the cataloging 
rules and incorporate appropriate expert 
knowledge, no one has reported that they 
have resumed the development of expert 
cataloging systems. It appears that cata- 
logers and library managers both perceive 
other avenues as easier and more reason- 
able ways to reduce the human ef fort and 
thus the cost of cataloging. Efforts such as 
outsourcing of cataloging and the devel- 
opment of cataloging workstations seem 
to have taken the place of a fully matured 
expert system. 

Recently, researchers and developers 
have focused on the improvement of the 
cataloger's tools, e.g. cataloger's worksta- 
tion (Brisson 1995a ; 1995b). In this ap- 
proach, advances in computer technol- 
ogy — from dedicated terminals to personal 
computers — are predominantly used to 
benefit human work patterns. At first these 
workstations were only able to handle one 
task at a time, but eventually were im- 
proved to allow the use of multiple appli- 
cations simultaneously. The major advan- 
tage of these workstations is that 
catalogers can access electronic versions 
of cataloging resources. Although such 
workstations are very useful and bring im- 
mediate benefits to the catalogers, they 
do not reduce the intellectual effort in the 
cataloging process, which continues to be 
the catalogers' responsibility. 

Cataloging Rules 

While some researchers tried to build 
prototype expert systems and found it dif- 
ficult to implement cataloging rules in 
these systems, others were interested in 
the lunction and structure of the rules. 
This is a narrower yet vital part of a com- 
prehensive cataloging expert system de- 
velopment because it is an essential part 
of a knowledge base. Different chapters 
of the cataloging rules have been studied 
from a variety of perspectives, with the 
common purpose of enhancing their un- 
derstanding and suggesting how the rules 



could be improved in further editions. 

Svenonius and Molto (1990), Davies 
(1992), and Weibel (1992) prepared ex- 
tensive reviews of automated descriptive 
cataloging and related research. They 
point to Sandberg-Fox (1972) as the first 
to work on this topic. Sandberg-Fox 
(1972) compared the Anglo-American 
Cataloguing, Rules (1967) (AACR) to the 
ol der A . L. A . Cataloging Rules for Author 
and Title Entries (1349) for the comput- 
erized assignment of access points. She 
found that both rules posed problems for 
implementation in a computer environ- 
ment. Given the problems that the proto- 
type expert systems encountered and the 
comments of researchers on their diffi- 
culties with implementing AACR2, 
Sandberg-Fox s result suggests that these 
problems were already present in previ- 
ous editions of the cataloging rules. 

Jeng (1991a) analyzed chapter 1 (Gen- 
eral Rules) of the Anglo-American Cata- 
loguing Rules, 2d ed., 1988 revision 
(AACR2R), to identify the definition and 
functions of a rule. For the use of rules in 
cataloging monographs she created condi- 
tion/action pairs for rules from chapters 1 
and 2 (rule 1.0A1, Sources of Information, 
to rule 1.1G4, Items Lacking a Collective 
Tide, and rule 2.0A1, Scope, to rule 2.1G2, 
Relationship Between Statement of Re- 
sponsibility and Works in an Item that 
Lacks a Collective Title) into 77 condi- 
tion/action pairs. This process would be 
necessary to implement cataloging rules in 
the knowledge base of an expert system. 

Further, in a sample of 50 title pages, 
Jeng identified which rules were applica- 
ble and how frequently they would have 
been used. She determined only 13 of the 
77 condition/action pairs were applied to 
every title page in the sample, while more 
than half of the condition/action pairs 
were never applied. For the Rule 1.0C1, 
which was divided into 25 condition/ 
action pairs, only 3 pairs were applicable 
to all the title pages in the study. Although 
the sample of title pages was small, the 
results suggest that the rules may not be 
equally applicable, with some being gen- 
eral and some very specific. We feel that 
for a much larger sample of documents in 
a limited domain it could be anticipated 
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that not all the rules would need to be im- 
plemented, thus making the knowledge 
base oi a system somewhat more manage- 
able. While the findings of some research- 
ers agree with Jeng's, others have found 
the contrary — in some cases more rules 
would be necessary because of the spe- 
cialized role of the rarely used rules. 

M eador and Wittig ( 1 991 ) studie d how 
frequently the rules used for choosing ac- 
cess points in AACR2 (chapter 21), were 
applied to books in chemistry and eco- 
nomics. In their experiment, only 12 of 
143 rules in that chapter were used on a 
sample of 30 items from each discipline. 
The authors concluded that, although the 
core rules were different for each disci- 
pline, the same expert system could work 
for both because: (1) a small number of 
rules was used in total, and (2) rules used 
for economics represented a subset of 
rules used for chemistry. This study, al- 
though on a small sample, indicates that 
some of the rules are general and could be 
used across different disciplines. 

Weiss (1994) reported on a project 
conducted at the National Library of 
Medicine (NLM) intended to create an 
expert system to assist catalogers with 
personal name authority work. Imple- 
menting only AACR2R chapter 22 (Head- 
ings for Persons) proved insufficient. 
Weiss found that to construct personal 
name headings correctly, the expert 
knowledge of experienced catalogers was 
required for the correct translation of 
personal names appearing in documents 
into personal name headings. He con- 
cluded that acquiring diat expert knowl- 
edge is a time-consuming and expensive 
task. One might imply from Weiss that the 
foundation for correct decision making in 
cataloging requires not only a sound 
knowledge of cataloging rules but also an 
extensive experience in the application of 
these rules. 

The results of Jeng's (1991a) and 
Meador and Wittig's (1991) studies point 
in the same direction. Because the entire 
set of cataloging rules is difficult to imple- 
ment in a knowledge base, using only a 
subset of the rules may be a better ap- 
proach. If the rutes used more frequently 
are also the most important rules and ex- 



clusion of other rules does not lead to 
problems, perhaps only that smaller set of 
important rules needs to be implemented 
in an expert system. Weiss's (1994) find- 
ings do not appear to support this conclu- 
sion, however. These contrasting results 
show that using a subset of rules in a cata- 
loging expert system needs to be studied 
more fully. One way to address this prob- 
lem might be to develop a knowledge 
base using the subset of important rules 
to identify problems that arise. Judging 
from the results of prototype expert sys- 
tems, it is clear that there is some subjec- 
tive interpretation of cataloging rules be- 
cause of missing information or 
insufficient guidance in the rules. More- 
over, in the studies cited, the issue of re- 
searcher bias is not addressed, and its ef- 
fect is undetermined. 

Clearly, there are some difficulties 
with the logical structure of cataloging 
rules. Hjerppe and Olander (1989), 
Ercegovac (1990), Jeng (1991a), and 
Weiss (1994) all commented on problems 
with inconsistent, contradictory, insuffi- 
cient, and redundant information con- 
tained in the rules. Fidel and Crandall 
(1988) examined AACR2 from a general- 
ized database approach, using the en- 
tity-relationship model. They catego- 
rized rules into six types: (1) content; (2) 
establishing entities, relationships, or at- 
tributes; (3) authorized sources; (4) do- 
main; (5) format; and (6) access points. 
They demonstrated that the current ar- 
rangement of AACR2 scatters rules of the 
same category in different parts of the 
text and also mixed different categories 
into the same rule. They believed that this 
adds to the confusion of what are and 
what are not conceptual rules. They sug- 
gested that systematic investigation of the 
structure and function of the rules would 
be possible with the entity-relationship 
model. 

Taniguchi (1996) built a prototype ex- 
pert system for analyzing AACR2R. This 
system was designed to analyze the inter- 
nal structure of the rules and the relation- 
ships among them. The first phase of the 
analysis in this system, a manual one, was 
a transformation of cataloging rules from 
natural language into a more formal, 
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structured language that a computer sys- 
tem could process. The next phase, a com- 
puterized approach, involved the normal- 
ization of rules, i.e., converting a rule 
from its still relatively natural, but already 
structured form, into an entirely mathe- 
matical, logical form (condition/action 
pairs). The third phase of Taniguchi's sys- 
tem involved testing of a rule with rule 
templates. These rule templates were 
skeletal schemes that provided the very 
basic structures of rules. They consisted 
of the characteristics, i.e., the variable in- 
formation of condition/action pairs, nec- 
essary for logical functioning and verifica- 
tion of information for all the necessary 
components. Three types of templates 
were developed: (1) for descriptive rules, 
(2) for definition rules, and (3) for organi- 
zation rules. 

It seems reasonable to expect that 
Taniguchi's first phase was rather subjec- 
tive, given that rules can be interpreted 
differently by different people. As a con- 
sequence, the second and subsequent 
phases could then have different results 
as well. Yet, these differences might illus- 
trate the missing information in the rules, 
which might lead to improvements in rule 
structure. 

There are different views on the cate- 
gorization of cataloging rules. While 
Taniguchi's three categories of rules were 
description, definition, and organization, 
Jeng (1991a) listed five categories: defini- 
tion, description, organization, identifica- 
tion of the source of information, and 
transcription. Fidel and Crandall (1988) 
specified six categories: content; estab- 
lishing entities, relationships or attrib- 
utes; authorized sources; domain; format; 
and access points. Taniguchi (1996) ex- 
pressed a need for consensus on this issue 
as a condition for success of any analysis 
and suggested that the proposed system 
of fered an appropriate method that could 
help with developing less ambiguous and 
more consistent rules. 

Although one might argue that human 
reasoning is different from computer pro- 
cessing, the computerization of a process 
off ers a way for people to organize their 
thoughts and produce a more systematic 
assessment of their own work. The system 



may not be perfect or be able to perform 
analysis independent of human experts, 
but it offers a way to gain a deeper and 
more systematic understanding of human 
experts' own reasoning, including their 
strengths and limitations. 

In these studies of AACR2, research- 
ers have indicated how, and why, it is diffi- 
cult to "teach" a computer how to catalog. 
This should come as no surprise to in- 
structors of cataloging, who encounter 
the need to convey document patterns 
and the intricacy of what is stated in the 
rules, and (equally as important) what is 
not stated in the rules to their students. 
Complicating all these instructional chal- 
lenges are the different life experiences 
that the students bring to cataloging pro- 
cess. Much of the "artistry" of teaching 
cataloging seems to lie in teaching how to 
search for clues that provide a basis for 
the creation of an appropriate biblio- 
graphic description. Such clues often are 
found in the combination and context of 
the bibliographic elements in a document 
and the cataloging rules and their inter- 
pretations appropriate to the situation. 
The question arises of how this artistry 
can be formalized in an expert system. 
For the majority of "typical" documents, 
expert systems using knowledge base 
likeiy could worlc independently, while 
for more complicated documents, these 
systems would need to interact with hu- 
man experts. 

Choice and Form of Access Points 

Access points are necessary because they 
offer a way to access catalog records sys- 
tematically and to arrange them for dis- 
play. Thus, both their selection and their 
formatting are essential components of 
existing catalog codes. Access points have 
been a special area of inquiry from two 
perspectives. First, they have formed a 
subset of studies of cataloging rules. Sec- 
ond, access points have been studied with 
a view toward better control of what 
would be one part of a comprehensive ex- 
pert cataloging system. Such a system 
would incorporate automatic reading of 
parts of documents to produce data for 
the creation of access points. 
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Svenonius, Baughman, and Molto 
(1986) reported studies of automatic rec- 
ognition of names and titles on the title 
page. Their purpose was to identify the 
possibilities of simplifying the determina- 
tion of name access points. Their work 
made the assumption that optical charac- 
ter recognition (OCR) was advanced 
enough to identify text on title pages ac- 
curately. They investigated responsibility 
statements in a sample of 400 mono- 
graphs from an academic library. All the 
documents were in English. Author 
names most frequently appeared on title 
pages, preliminaries, and tables of con- 
tents (38% of the 1,310 author names in 
the sample). Tables of contents contained 
the largest share of all names (66% of the 
total 2,536 names). However, the re- 
searchers further found that many of the 
personal and corporate names that ap- 
peared on title pages, preliminaries and 
tables of contents were neither authors 
nor people or bodies whose function was 
related to authorship. 

Svenonius and Molto (1990) investi- 
gated the possibility of automatic recogni- 
tion of type of responsibility from ma- 
chine readable title pages of monographs. 
They also examined whether title page 
data were adequate for identification of 
name access points. The success rate was 
measured by the number of correctly 
identified access points (all names), and 
in terms of precision (the number of cor- 
rectly identified name access points auto- 
matically identified by their algorithm 
compared to the records created by LC or 
NLM), Their system correctly identified 
90% of the names that appeared on the ti- 
tle pages. Of the correctly identified 
names, 95% were also chosen as access 
points in LC and NLM cataloging records. 
Svenonius and Molto estimated that ap- 
proximately 88% of the access points se- 
lected by LC and NLM could be automati- 
cally derived from title page data. 

Molto and Svenonius (1991) also in- 
vestigated automatic recognition of 
names appearing on samples of title pages 
from monographs in an academic library 
and a public library. Their method cor- 
rectly identified 86% of corporate names 
and 85% of personal names. Only 65% of 



the names identified as personal names in 
the sample were also chosen by LC as per- 
sonal name access points. Approximately 
14% of corporate names and 15% of per- 
sonal names were either not recognized 
or were incorrectly recognized. Molto 
and Svenonius suggested that their rules 
for establishing name access points and 
the graphic presentation in the docu- 
ments contributed to the incorrect or 
failed recognition. 

The state of OCR technology at the 
time of these tests could not reliably in- 
terpret text for cataloging purposes. 
Problems arose due to the function of 
names and the artistic design employed in 
the documents. Systems have difficulties 
identifying the function of some authors. 
While illustrators and translators are usu- 
ally clearly credited, the roles of editors, 
compilers, and other contributors can be 
harder to determine. In some cases these 
functions had to be inferred. Electronic 
files used to produce printed texts might 
be more amenable for computer inter- 
pretation. Perhaps cataloging rules need 
to be more flexible to allow the use of this 
new technology. 

Excluding these obstacles, two issues 
emerged from the research on the auto- 
matic recognition of access points. First, 
it was not possible to anticipate all the 
possible personal and corporate names to 
be included in name authority files. 
These files are a necessary component of 
expert systems. Words scanned in a docu- 
ment need to be compared to the names 
in the authority file by an expert system, 
in order to be accepted as a name, verified 
against other names, and accepted as an 
access point. Second, even if it were pos- 
sible to identify all the appropriate names 
correctly and their current form automat- 
ically, the ability to distinguish different 
functions of authorship remains a chal- 
lenge. A mechanism is needed to distin- 
guish among writers, translators, editors, 
and numerous other people who appear 
on the document and are responsible in 
dif ferent ways and degrees for its creation. 

Both of these are major impediments 
to building expert cataloging systems as 
well as a challenge for the everyday 
work of catalogers. The identification 
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and interpretation of bibliographic data 
in a document is the underlying skill. The 
"fuzzy" use of language, which poses such 
difficulties for computer systems, is less 
problematic for people. Catalogers must 
judge the importance of names and titles 
appearing on the document and decide 
which should be chosen as access points 
and which should be disregarded. This is 
where experience plays a crucial role. 
Here the cataloger finds the organization 
and layout of the document helpful, 
whereas an expert cataloging system finds 
them problematic. 

Document Characteristics 

There are linguistic characteristics most 
people use without much thought, but an 
expert system must be programmed to 
recognize them. Many catalogers know 
that terms like "Associate Professor of 
Clinical Medicine" appearing under a 
name is not a title of the book but rather a 
person's affiliation. People familiar with 
American culture know that "Washing- 
ton" can refer to a city, a U.S. state, or a 
personal or corporate name. These peo- 
ple can easily distinguish through context 
which "Washington" is meant. The chal- 
lenge to the expert systems is to interpret 
that context properly. 

Studies of visual and linguistic charac- 
teristics of documents are extremely im- 
portant in building expert systems. Thirty 
years ago, Kilgour (1969) suggested the 
automatic extraction of bibliographic data 
from documents. Researchers of this op- 
eration still encounter difficulties of tech- 
nological limitations similar to those re- 
searching prototype expert systems. 
Although the optical reading of catalog 
cards for their conversion into database 
records has had good results, OCR equip- 
ment is not yet sufficiently developed to 
create catalog records from original docu- 
ments. Two characteristics of documents 
add to the task's difficulty. One is the inde- 
terminate nature of language, which 
poses relatively little problem to speakers 
of the language, but is difficult to formal- 
ize for a computer system. The other is 
the graphic design of documents. Al- 
though design can be diverse, catalogers 



expect that certain types of data will usu- 
ally appear in certain places. Understand- 
ing the visual and linguistic structure of 
documents may, therefore, be necessary 
in the development of cataloging rules, if 
they are to be used effectively in an expert 
system. Rules derived by generaiizing 
characteristics observed in documents 
can ensure more homogeneous interpre- 
tation of data in documents. 

Jeng (1986) suggested that title pages 
tend to have structure, which could be 
used for automated data recognition. She 
defined a "block" (one or more words sep- 
arated from other blocks by blank vertical 
space) as a basic unit of analysis. She 
found that blocks have two characteris- 
tics!: physical appearance and content. 
Physical appearance is described with 
type face and size. Content is the meaning 
of the text. Jeng (1987, 1988) later pre- 
sented a study of linguistic characteristics 
on a sample of 203 title pages in which she 
identified the occurrence of words, 
phrases, and punctuation marks. In her 
sample, an average of 23 words appeared 
on a title page. The most frequent types of 
words used were proper names (36%) and 
common nouns (26%). Other syntactic el- 
ements that appeared were prepositions 
(11%), initials (initials in names or abbre- 
viations) (7%), articles (6%), adjectives 
(5%), conjunctions (4%), and verbs (4%). 
She examined the occurrence of these el- 
ements in different blocks on title pages 
(e.g., titles, other title information, state- 
ment of responsibility, publisher). 

Jeng (1991b) continued her research 
on title pages, concentrating on their vi- 
sual characteristics. On the basis of fre- 
quency of bibliographic elements, she 
proposed a three -level prototype title 
page, with an appropriate location and se- 
quence of bibliographic elements on the 
page. The first level contained the title 
proper, author and publisher informa- 
tion. The second level added other title 
information and place of publication, 
with the third level adding author affilia- 
tion and year of publication. Based on her 
findings, Jeng concluded that because 
cataloging rules were not derived from a 
systematic knowledge of title pages, the 
interpretation of title page information is 
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more dependent upon a catalogers inter- 
pretation and experience. She suggested 
that her research into the composition of 
title pages has implications for both the 
future development of cataloging rules 
and expert cataloging systems. 

Weibel, Oskins, and Vizine-Goetz 
(1989) also investigated the possibility of 
automated recognition of data on the title 
page and tested Jeng's findings (Jeng 
1986, 1988). They simulated OCR tech- 
niques using locally produced electronic 
versions of title pages. This simulated 
OCR technique was necessary because 
contemporary OCR devices were not able 
to perform at the level that the research- 
ers required. Their goal was to identify 
automatically the following data from the 
title page: tide, other title information, 
statement of responsibility, edition state- 
ment, publisher, place, and date of publi- 

sented by a "token," which was a 
space-delimited character string such as 
name or title (what Jeng called a "block"). 
Tokens had attributes representing posi- 
tion, size, typeface, and the case class of 
each character string. Case class reported 
upper- and lower-case characters, num- 
bers, punctuation, symbols, and blank 
spaces. After the system identified the 
data, a confidence level was calculated, 
and the data were assigned to appropriate 
fields. 

In their sample, 45 records of Eng- 
lish-language monographs were used as 
training data to adjust and improve upon 
the theoretic assumptions for the system. 
Forty-six similar records were used to test 
the system. Their results showed that 
identification of publisher, place and date 
of publication was far better than the 
identification of other bibliograpluc ele- 
ments. For these bibliographic data they 
achieved 80% correct identifications. 
This success was attributable to being 
able to compare publication related data 
to external Files. For example, publisher 
names could be compared to an authority 
file of publishers. They found that it was 
much more difficult to identify titles, 
other title information, and statements of 
resp ons i bi I i ty co rrectly. 

It seems likely that an expert system 



for optical recognition of bibliographic 
data will require human verification of 
the results. Catalogers will continue to 
need to master the identification and in- 
terpretation of bibliographic data from 
documents. Places where this informa- 
tion is likely to appear are frequently, but 
not always, prescribed in the rules. How- 
ever, often document designers or pub- 
lishers don't follow standard practices in 
the placement of data. 

Another suggestion derived from the 
development of prototype expert systems 
is the study of catalogers 'perception and 
understanding of document layout. 
Ercegovac (1990) addressed tliis in her 
attempt to identify the characteristics of 
documents for identification of access 
points the producer of the map. The use 
of document characteristics has been 
found to be very useful by catalogers. 
That use, however, has not heen incorpo- 
rated In the cataloging rules. How do stu- 
dents and novice catalogers learn to use 
diese clues? Most likely, experienced in- 
structors and experienced catalogers im- 
part this information on using the clues as 
well as the use of the catalogers general 
knowledge to create a complete catalog 
record. 

Cataloging Expertise 

These expert cataloging system research 
efforts illustrate some of the difficulties of 
capturing the skill and knowledge re- 
quired to catalog. Cataloging has been 
found to be so complex that it needs to be 
studied as a range of different subtasks. 
The details of these varied tasks need to 
be put together as pebbles in the mosaic 
of cataloging expertise. A similar conclu- 
sion was made by Jeng (1992a). She as- 
serted that the understanding of the role 
of cataloging expertise (the general and 
persona] knowledge of expert catalog- 
ers) needs to be explored. In addition, a 
larger knowledge base (with trans- 
formed rules) and better user interfaces 
must be developed. 

Ercegovac (1990) was the first to dis- 
cuss the incorporation of personal knowl- 
edge from expert catalogers and specific 
document characteristics into an expert 
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system for descriptive cataloging. For 
Mapper, she interviewed experts to de- 
vise additional rules for the knowledge 
base to supplement situations in cases 
where AACR2R provided insufficient de- 
tail. She studied the characteristics of 
printed maps and interviewed experts 
about common characteristics of these 
maps in order to create a knowledge base. 

In a later paper, Ercegovac (1992) dis- 
cussed observation techniques she used 
in the development of Mapper. Inter- 
views and surveys were used in addition to 
the careful study of the characteristics of 
maps. The first two approaches were de- 
signed to elicit details about the data nec- 
essary for map cataloging. For instance, 
questions were asked about definitions of 
authorship and publishing data, and a 
ranking of specific responsibility func- 
tions. This enabled her to understand this 
narrow field in detail and successfully imi- 
tate expert catalogers in a computerized 
system. 

Jeng (1992b, 184) defined expertise as 
"the high degree of skills, dexterity, or 
knowledge of a specific subject area." In 
that study, in which she studied cataloging 
expertise and the transfer of that exper- 
tise, she interviewed the head of the cata- 
loging department and three professional 
catalogers at National Agricultural Li- 
brary (NAL). Documentation related to 
the cataloging process and job descrip- 
tions was also gathered. Jeng concluded 
that people doing the lowest ranked tasks 
have the least expertise, that transferring 
expertise is well formalized in NAL, and 
that informal cooperation among catalog- 
ers was also common. Jeng emphasized 
that learning from questions that arose 
during regular workflow is the most com- 
mon method for transferring expertise. 
This method would likely prove to be an 
obstacle in building expert systems. 

Ercegovac (1992) found that a focused 
interview with a detailed schedule was a 
better source of information about the 
cataloging process than verbal reports 
and associated protocol analysis. The fo- 
cused interview is less biased by the weak- 
nesses of human memory. In her work, ex- 
act questions could be asked about 
authorship and the characteristics of spe- 



cific maps. One wonders how difficult it 
would be to answer in an exact way the 
questions about the cognitive processing 
of how and why someone decided to fol- 
low a detail on a map for determining au- 
thorship when that detail is not men- 
tioned in cataloging rules. 

Jeng (1997) found verbal reports and 
protocol analysis for understanding cata- 
loging expertise to be a successful and ap- 
propriate method for acquiring an under- 
standing of cataloging expertise. In that 
project, she studied the kinds of knowl- 
edge and skills catalogers needed, what 
tasks were involved in cataloging, and 
what strategies catalogers used to solve 
specific problems. She used multiple 
methods: (1) a questionnaire, (2) a self- 
administered verbal report of cataloging 
sessions, (3) verbal protocols of sessions 
in which catalogers were asked to "think 
aloud" during the process and the re- 
searcher observed the process, and (4) 
verbal protocols of training sessions in 
which the researcher was taught by a se- 
nior cataloger. Jeng did not present the 
result of this research, but she illustrated 
her research methods with examples of 
the work process of two catalogers. 
Workflow was found to be influenced by 
the workscreen. One catalogers prob- 
lem-solving strategy was to leave a diffi- 
cult decision until the end of the task, 
while the other solved the problem when 
it occurred. Little explanation was pro- 
vided in her report about the cognitive 
process of decision making. The two cata- 
logers used standard cataloging knowl- 
edge for their work and problem solving 
techniques. The report of the catalogers' 
work did not mention the use of knowl- 
edge beyond cataloging rules. 

Ercegovac (1992) and Jeng (1997) sug- 
gest that knowledge of cataloging rules 
and procedures — as well as knowledge of 
cataloging systems — are necessary. Some 
specific knowledge, which is not written in 
cataloging rules or policies but is part of an 
oral tradition among catalogers in the 
same environment, is also necessary. This 
knowledge needs to be supplemented 
with general knowledge. The unwritten 
rules and general knowledge that assist in 
problem solving needs further research. 
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Different strategies of employing this 
knowledge in solving cataloging problems 
are used by different catalogers. 

The different aspects of cataloging 
expertise likely will require different 
research approaches. While self- 
administered reports might be useful in 
studying the mechanics of cataloging pro- 
cedures, observations and teaching ses- 
sions might be more useful for obtaining 
reasons for particular steps in the proce- 
dures. While questionnaires might be 
useful for learning about certain concepts 
in the cataloging process, they are useless 
if catalogers are not aware of those con- 
cepts. Different documents are examined 
and described in bibliographic records 
containing different data. In the same 
way, different kinds of processes and cate- 
gories of knowledge need to be examined 
and described differently. Catalogers 
need to employ their long tradition of cat- 
egorization of data, enriched with related 
disciplines, on the cataloging process to 
develop mechanisms for systematic rep- 
resentation of their expert knowledge. 

Summarized Implications 

In this paper, we have reviewed 25 years 
of work and thinking in the area of the use 
of computers to create cataloging re- 
cords. Particular attention was paid to the 
types of knowledge necessary for these 
systems and to the problems with con- 
tents of the knowledge base. Have we 
found any clues about the missing peb- 
bles for the mosaic of the cataloging ex- 
pertise? We believe we have. Here are 
some clues found in the literature we re- 
viewed: 

• Comprehensive cataloging systems 
(i.e., those that cover all cataloging op- 
erations and all document formats) 
are too complex to develop either in 
research projects or for work environ- 
ment. 

• Reporting successes and failures in 
the knowledge acquisition for these 
prototype systems has been as impor- 
tant for the development of the disci- 
pline as successes and failures in 
actual development and implementa- 
tion of the expert systems themselves. 



• Some cataloging tasks seem to be 
more amenable to automation than 
others. Such tasks seem to have rou- 
tine procedures. Standardization of 
such procedures could aid in the de- 
velopment of expert systems for the 
automatic generation of the content 
of the cataloging records. Such stan- 
dardization would also be beneficial 
for the education of the students and 
new catalogers. 

• AACR2 has been found to be prob- 
lematic primarily because: (1) of the 
inconsistent logical structure of the 
rules; and (2) the information re- 
quired to make cataloging decisions is 
missing in the rules. Difficulties aris- 
ing from those problems are trouble- 
some not only for the expert systems, 
but also for human experts. 

• Characteristics of documents are im- 
portant in developing expert systems 
that could automatically derive biblio- 
graphic data from the document. 
These same characteristics are useful 
to human catalogers. Researchers 
have found some patterns in the char- 
acteristics that would help both. 

• Systems and people are fairly success- 
ful in recognizing personal names. 
The difficulty lies in deciding about 
the role of a particular name in the 
generation and production of the doc- 
ument. 

We wish to add some more reflection 
on each of the six clues for the missing 
pebbles by returning to each individually. 

Initial optimism that theoretical re- 
search in building expert systems would 
help with the complex task of descriptive 
cataloging dissolved quickly under the 
disappointing performance of prototype 
systems. These complex and comprehen- 
sive systems were generally not success- 
ful. Researchers then focused on smaller, 
narrower areas, in which more under- 
standing and less variability existed. This 
resulted in some success. 

Few researchers who built prototype 
expert systems have reported on actually 
eliciting knowledge from human ex- 
perts, but they have noted that rules can 
be appropriately interpreted only by ex- 
pert catalogers. Information contained in 
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cataloging rules has been shown to be in- 
sufficient by themselves for making cata- 
loging decisions. Rather, catalogers base 
many decisions on information acquired 
thorough experience. Because this infor- 
mation is subjective and varies at differ- 
ent levels of expertise, cataloging results 
also vary. This subjectivity may result in 
inconsistent catalog records and calls for 
greater standardization. 

One approach toward improving con- 
sistency could be the standardization of 
cataloging procedures for particular types 
of documents. It seems reasonable to be- 
lieve that some types of documents con- 
sistently require the use of certain proce- 
dures. For example, some items require 
the use of certain sequences of rules to be 
consistently applied. These procedures 
may involve use of some external knowl- 
edge. If these cases could have the exter- 
nal knowledge captured, they might be 
amenable to formalization in a knowledge 
base. Several benefits could result. First, 
more consistent decisions and more ho- 
mogeneous catalog records could be pro- 
vided. Second, new catalogers could 
study and understand in detail all the 
steps and details in the process. A third 
benefit would be the storage and docu- 
mentation of the expertise. 

Problems using AACR2 as the source 
of knowledge have lead to the investiga- 
tion of cataloging rules. The structure of 
rules, their relationships, and the func- 
tion and use of individual rules and chap- 
ters have been studied by a number of re- 
searchers. Each has concluded that 
cataloging rules were not particularly ame- 
nable for immediate computerization but 
could, and should, be made more adapt- 
able for Implementation in a knowledge 
base. This hecomes particularly important 
as the use of computers to enhance pro- 
ductivity and accessibility increases and 
demands to reduce processing costs rise. 

The need to implement cataloging 
rules in a knowledge base is not the only 
reason to improve the rules. It is clear 
from prototype system development that 
there is some subjective interpretation of 
cataloging rules because of missing infor- 
mation. More systematic and consistent 
rules could result in easier work for cata- 



logers, who could spend less effort on in- 
ferring the missing or contradictory infor- 
mation. More consistent catalog records 
could result if there are fewer individual 
decisions. These are also areas that would 
require more emphasis in the education 
of new catalogers. Instruction in the use of 
cataloging rules should be complemented 
with an examination of inconsistencies and 
gaps in the rules. Strategies commonly 
used by expert catalogers could be used to 
illustrate problems and help novices un- 
derstand the kind of external knowledge 
they need to employ in certain situations. 
Library and information science teachers 
might then better prepare students to cat- 
alog in real environments. 

Some researchers have studied the 
characteristics of documents and experi- 
mented with automatic recognition of 
data from the document. In their work, 
they all made the assumption that future 
OCR technology will be able to correctly 
read title pages regardless of graphic de- 
sign variations. Some also suggested that 
the electronic version of the printed title 
page should be accepted as the primary 
source of information. Results in this area 
have been quite encouraging. More than 
half of personal names, corporate names, 
and title access points could be automati- 
cally assigned in the English-language 
monographs that were tested. 

Two major problems exist with the ac- 
curate automatic identification of names: 
(1) names need to be included in the 
knowledge base and (2) types of responsi- 
bilities need to be distinguished. These 
two areas also need to be mastered by be- 
ginning catalogers. While knowledge of 
names mostly comes from the catalogers 
personal knowledge, types of responsibil- 
ities are learned in cataloging courses and 
through experience. Knowledge of which 
names are associated with the creation of 
a document and where they appear in a 
document is one of the basic skills cata- 
logers need to master. It is one closely as- 
sociated with the knowledge of visual 
characteristics of documents. 

Understanding visual document char- 
acteristics might help with cataloging when 
catalogers are not familiar with the culture 
and language of the document, provided 



92/ LRTS • 43(2) • Sauperl and Saye 



that these visual clues are not culturally 
based. Identification of these characteris- 
tics could help catalogers find and interpret 
the necessary bibliographic data. 

The research we have reviewed here 
has covered cataloging expertise and the 
cataloging process. Understanding the 
decision process is as important as under- 
standing sources, rules, or building cata- 
loger-friendly interfaces on workstations. 
Unfortunately, these researchers seem to 
have contributed little to the real working 
environments and procedures of catalog- 
ers. While significant progress has been 
made in the development of cataloging 
workstations, which make multiple cata- 
loging tools available for simultaneous 
consultation on the computer screens, 
they continue to leave the challenging 
and difficult intellectual work entirely to 
the cataloger. 

Conclusions 

From the studies presented one can con- 
clude that catalogers invest significant in- 
tellectual effort in the interpretation of 
cataloging rules and in the interpretation 
of document characteristics. They learn 
from their own experience and from expe- 
rienced colleagues how to master these 
two tasks. The interpretation of the rules 
likely depends both on the general knowl- 
edge and the expertise of the cataloger. 
Expert knowledge comes from formal ed- 
ucation and interaction with colleagues. A 
culture of sharing expertise among col- 
leagues has great value here. The general 
knowledge of catalogers is the knowledge 
that is not directly connected to their pro- 
fession. It is gained tlirough education 
and social interactions in the community. 
Although these aspects were not studied, 
a systematic approach to the individual in- 
terpretations of the rules might result in 
easier work and in more consistent deci- 
sions. Consistency and clarity of catalog 
records is, after all, the main goal of all 
catalogers, as they try to provide the most 
relevant and accurate information about 
sources to the users. 

What must a student interested in en- 
tering the cataloging specialty learn to be- 
gin the journey to becoming an expert cat- 



aloger? The research in prototype expert 
systems has demonstrated that knowl- 
edge of the cataloging rules is not suffi- 
cient. It was shown that there are several 
missing pebbles from the mosaic of cata- 
loging expertise. The ability to interpret 
cataloging rules is one important area. An 
example of this form of interpretation is 
represented to a limited degree by Li- 
brary of Congress Rule Interpretations 
(LCRI). In addition, a high degree of ex- 
perience and common sense has to be 
employed to make the necessary judg- 
ments called for by the rules. 

Another area to be addressed is the 
sequence of the rules in the system. Cer- 
tainly there are many documents that 
follow the same rule pattern. Some of 
these patterns are taught in cataloging 
classes. Others, less common, are only 
mastered with experience, while a few 
always require innovative approaches to 
rule application. 

Cataloging workstations currently uti- 
lize relatively rudimentary knowledge 
technology. For documents, which follow 
certain patterns, these workstations could 
assist by providing templates, such as in- 
ferring the publisher from the ISBN, as 
suggested by Davies (1987). Molto and 
Svenonius (1998) proposed an interface for 
the online version of AACR2R which would 
make the use of this basic tool easier. Per- 
haps the electronic version of AACR2R will 
begin realize this hope. Certainly linking 
the LCRI and AACR2R within these 
workstations would be a useful tool for cata- 
logers. Again, the Catalogers Desktop from 
LC offer some hope for optimism. 

Access points are essential in the effi- 
cient retrieval of documents. The identi- 
fication of names, titles, and functions as 
well as ranking their importance is neces- 
sary. Experiments have demonstrated that 
many access points and their proper head- 
ings for personal names could be automat- 
ically generated. These decision criteria 
do not come easily. Instead they are the re- 
sult of multiple encounters with cataloging 
rules. Some criteria are suggested in 
AACR2R and LCRI, while others are not. 

Essentially, students aspiring to the 
cataloging specialty cannot reasonably be 
expected to be taught all the solutions to 
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problems they may encounter. Rather, 
their instruction, in addition to providing 
the basic building blocks of cataloging 
(i.e., rules), must concentrate on develop- 
ing problem solving skills which will allow 
them to enhance their cataloging experi- 
ence base. 

Catalogers have demonstrated that 
they can, and want to, use computers in 
the best ways possible. At the moment, 
however, some aspects of their work are 
more amenable than others to the em- 
ployment of this tool. This does not mean, 
however, that we should not continue to 
try to understand and formalize the cata- 
loging process further. Knowledge acqui- 
sition methods offer a systematic inquiry 
into the cataloging expertise. A thorough 
understanding of the mental processes in- 
volved in cataloging offers not only a tool 
for developing expert systems, but also for 
rationalization of our own work. System- 
atic processes are easier to learn and the 
transition from novice to an expert is 
therefore smoother. Such approach 
should appeal to the catalogers, who are 
traditionally regarded as systematic orga- 
nizers of materials and information, and 
to all those who are concerned with the 
quality of library catalogs. 

Considering the research and devel- 
opments to date, it is apparent that the ex- 
pert knowledge base continues — -and will 
likely continue — to reside with the cata- 
loged The new tools that might reduce 
the catalogers other burdens are not 
likely to reduce the need for human ex- 
pert intervention in the cataloging pro- 
cess. This expert knowledge continues to 
be the domain of the cataloger, who pro- 
vides the missing pebbles for the mosaic 
of cataloging expertise. 
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Problems in the Establishment of 
Nonunique Chinese Personal 
Headings with Special 
Reference to NACO Guidelines 
and Vendor-Supplied Authority 
Control 

Hsi-chu Bolick 

Current vendor software for authority controlisfoundto generate negative re- 
sults for nonunique Chinese headings in the local catalog. After authority con- 
trol, nonunique Chinese names in the bibliographic records are found 
routinely altered to match authority headings established in the Library of 
Congress (LC) authority file that are unrelated to the Chinese script of the bib- 
liographic record. Headings with standard diacritics in the Wade-Giles 
romanization scheme are the most problematic. The name headings that are 
negatively affected in the authority control services need to be corrected before 
the LC Pinyin conversion project takes place around the year 2000 to prevent 
headings from being distorted further in the Pinyin conversion process. In this 
paper, I examine the reasons why vendor software produces negative results 
for nonunique Chinese names, and suggest measures for Chinese- 
Japanese-Korean (CJK) libraries in North America and vendors who supply 
authority-control services that include CJK data to improve the situation. 
These include vendor software upgrades, modifications to CJK Name Author- 
ity Cooperative procedures, etc. The authority control service at the Univer- 
sity of North Carolina at Chapel Hill is used to illustrate the problem. 



In the fall of 1997, the Library of Con- 
gress (LC) announced to the East Asian 
library community its decision to adopt 
the Pinyin system for the romanization of 
Chinese. The planning for this; adoption 
includes converting LC's current Chinese 



bibliographic data from the present 
Wade -Giles transliteration system to the 
Pinyin system around the year 2000. The 
conversion will use an approach similar to 
the one the National Library of Australia 
took in converting its CJK (Chinese- 
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Japanese-Korean) data in 1996 (Eastlib 
Discussion List 1997). 

While the East Asian library commu- 
nity is planning for the adoption of the 
new standard, I will address some prob- 
lems in the conversion of nonunique per- 
sonal name headings that should be taken 
into consideration in planning for Pinyin 
conversion in order to ensure Chinese 
bibliographic data integrity. 

The use of avendor-supplied authority 
control service at the University of North 
Carolina at Chapel Hill (UNC), under the 
currently available software program, 
generates negative results for nonunique 
Chinese transliterated name headings in 
bibliographic records. Nonunique names 
in bibliographic records routinely are al- 
tered to the authority headings estab- 
lished in the LC authority file that are un- 
related to the bibliographic records (see 
figure 1). Headings with standard dia- 
critics in the Wade-Giles system are the 
most problematic. Although, in a technical 
sense, the retrieval of such headings is not 
affected, the presence of multiple authors 
under one romanized form presents two 
problems: retrieval sets high in recall but 
low in precision; and mispresented names 
in the local catalog that, if not corrected 
manually, will be distorted further in the 
process of Pinyin conversion when it takes 
place in a few years. 

Nonunique Chinese 
Name Headings 

The term "nonunique name" in the LC 
authority file refers to the use of one 
established heading for multiple people 
who cannot be distinguished from one an- 
other by the form of their names alone. 
Nonunique name authority headings 
occur when people share identical names, 
and additional personal data (such as 
birth and death dates or other qualifiers) 
are not available at the time the heading is 
established. It is common to encounter 
this type of heading in LC's authority 
records for all languages. 

Without qualifiers to the personal 
name, every Chinese name transliterated 
under the Wade-Giles scheme is subject to 
becoming a nonunique heading because of 



at least three factors: the nonapplication of 
the tonal value in transliteration practice; 
the use of diacritics in the Wade-Giles 
transliteration scheme; and Name Author- 
ity Cooperative (NACO) authority prac- 
tice and normalization. 

The Nonapplication of the Tonal 
Value in Transliteration Practice 

National standard Chinese is rich in hom- 
onyms, and is a tonal language. By apply- 
ing one of the basic four different tones to 
a Chinese syllable, a variety of unique 
Chinese characters is generated. But in 
North American cataloging practice, ac- 
cording to the current version of the 
ALA-LC romanization tables, the tonal 
values are ignored in transliteration of 
Chinese scripts (Library of Congress 
1991). This practice groups Chinese char- 
acters that are pronounced with the same 
sound but different tones under one 
transliterated form. For example, combi- 
nations of tonal numerical marks 1, 2,3,4- 
and the roman letters ta in ta 1 ta 2 ta 3 ta 4 
represent at least four different Chinese 
characters. In current LC practice, how- 
ever, these four characters are transliter- 
ated into one 'ta' in the authority and bib- 
liographic records. 

Diacritics in the Wade-Giles 
Transliteration Scheme 

Structurally, the Wade-Giles translitera- 
tion scheme doesn't employ a unique 
roman letter to correspond to each indi- 
vidual Chinese phonetic sound. Instead, it 
uses combinations of roman letters and 
diacritics to stand for some distinctive 
phonetic values. There are four 
diacritics — ayn, umlaut, circumflex, and 
breve — in the original Wade-Giles 
scheme (Lu 1995). The modified Wade- 
Giles transliteration scheme currently 
used by LC and East Asian libraries in the 
U.S. uses two diacritics, ayn (') and um- 
laut ("), to indicate aspiration and the 
middle vowel sound. For example, p't'k' 
represent aspiration of p t k, while u is the 
middle vowel sound to be distinguished 
from u. Chinese characters that are trans- 
literated into identical roman letters but 
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LC Authority Record for Chang, Yeu 



ARN: 768061 
Rec stat: c 

► Type: z 
Roman : ■ 
Govt agn: I 



Series: n 


Ser 


num: n 


► 1 


010 


► 2 


040 


► 3 


005 


► 4 


100 10 


► 5 


400 10 


► 6 


400 10 


► 7 


400 10 


► 8 


500 10 


► 9 


670 


► 10 


670 


( a . e . ) *b t 


► 11 


670 


► 12 


670 


► 13 


670 


► 14 


670 


► 15 


670 


► 16 


670 


chuan, 1643 


► 17 


670 


► 18 


670 


► 19 


670 


► 20 


670 


► 21 


670 


► 22 


670 


► 23 


670 


► 24 


670 


► 25 


670 


Y"u) 


3 


► 26 


670 



Enc lvl: 
Mod rec : 
Sub j : 
Geo subd: 
Subdiv tp: 



-R 3 



Source : 

Name use: a 

Subj use: a 

Ser use: b 

Rules: c 1 



Entered: 19820520 

Upd status : a 

Ref status : a 

Auth status: a 

Auth/ref: a 

Name: b 
n 82071276 II 
DLC *C DLC *d DLC *d DLC 
19940428061253.4 3 
Chang, Yu 3 
Zhang, Yu 3 
Zhang, You 3 
Chang, Kung-li 3 
Chou, Heng 3 

[Editor of Y'un Tai-ying lai hung ... ] 3 

Y'un, T.Y. Y'un Tai-ying lai hung ch r u yen lu, 1981 

p. (Chang Y'u) 3 

[Author of Hsiao tang chia] II 

His Hsiao tang chia, 1969: *b t.p. (Chang Y'u) 3 
[Author of Wen i ti jen wu chi ch r i fa] 3 
His Wen i ti jen wu chi chf i fa, 1953: *b t.p. (Chan Y'u) 1 
[Author of Hsin k r an ching pan ... ] 3 
His Hsin k"" an ching pan p r i p r ing cheng pai chiang 
*b caption (Chang Y'u) 3 
[Author of Feng hsia] 3 

His Feng hsia, 1975: *b t.p. (Chang Y'u) 3 
[Joint author of I*" ai-tzu-f an] 3 

Wei, M. T~ ai-tzu-t° an, 1954 (a.e.) *b t.p. (Chang Yu) 3 
[Author of Huan ching mi li] 3 
[Author of Chieh chung] 3 

Her Chieh chung, 1985: *b t.p. (Chang Y"u) 3 

[Author of Chang Y"u hsiao shuo hs'uan] 3 

His Chang Y"u hsiao shuo hs'uan, 1985: *b t.p. (Chang 



Her Huan ching mi li, 1983: 
(Her real name is Chou Heng) 3 



*b t.p. (Chang Y"u) vita 



Bibliographic Record in DRA Database Before Vendor-Supplied 
Authority Control 



DBCN: AOX-3204 



Entered: 04/17/1996 



Modified: 04/17/1996 



Type : a 
Srce: d 



Bib 1; 
111: 



m Enc 1 : I 
Audience : 



Desc : 
Form: 



a Ctry: 
Cont: 



hk Lang: chi Mod: 
Gvt: Cnf: Fst: 



Ind: Fic 


1 Bio: Dat tp: s Dates: 1995 Control: 






005 


;a 


19960410171011.0 $ 






010 


;o 


32376017 $ 






020 


;a 


9624474249 (pbk.) : $c HK$38.00 $ 






040 


;a 


SFR $c SFR $d JTZ $d NOC $ 






066 


;c 


$1 $ 






090 


;a 


PL2837.Y76 $b C48 1995 $ 






092 


;a 


F $ 






100 


i ;6 


880-01 $a Chang, Y u. $ 






245 


10;6 


880-02 $a Chuang feng pan sha / $c Chang Y' 


u chu 


$ 


250 


;6 


880-03 $a Ti 2 pan. $ 






260 


;6 


880-04 $a Hsiang-kang : $b Ch~in + yuan ch~ 


u pan 


she. 






$c 1995. $ 






300 


;a 


207 p. ; $c 17 cm. $ 






490 


i ;6 


880-05 $a Chang Y 'u hs "uan huan hsi lieh ; 


$v 18 


$ 


500 


; a 


Short stories . $ 







Figure 1. Example of Authority Verification Process on a Nonunique Chinese Heading. 

(Continued on next page ) 
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800; 1 ;6 880-06 $a Chang, Y'u. $t Chang Y'u hs'uan huan hsi 

lieh ; $v 18. $ 

880; 1 ;6 100-01/$1 $a l=G!:e. $ 

880; 10;6 245-02/$l $a !We!Lc!66!2 / $c !=Gl:eIUN. $ 
880; ;6 250-03/$l $a !Os2!JH. $ 

880; ;6 260-04/$l $a !a)!H$ : $b ! 49+1QZ ! 3K! JH!Na, $c 995. $ 

880; 1 ;6 490-05/$l $a !=GI :e !K0 !<h 2 li ! 3W ; $v 18 $ 

880; 1 ;6 800-06/$l $a !=G! :e. $t !=G! : e !K0 ! <h 2 li ! 3W ; $v 18. $ 



Bibliographic Record in DRA Database After Vendor-Supplied 
Authority Control 

DBCN: AOX-3204 Entered: 04/17/1996 Modified: 1/06/1997 

Type: a Bib 1: m Enc 1: I Desc: a Ctry: hk Lang: chi Mod: 
Srce: d 111: Audience: Form: Cont: Gvt: Cnf: Fst: 
Ind: Fic: 1 Bio: Dat tp: s Dates: 1995 Control: 

005; ;a 19960410171011.0 $ 
010; ;o 32376017 $ 

020; ;a 9624474249 (pbk.) : $c HK$38.00 $ 
040; ;a SFR $c SFR $d JTZ $d NOC $ 
066; ;c $1 $ 

090; ;a PL2837.Y76 $b C48 1995 $ 
092; ; a F $ 

100; 1 ;6 880-01 $a Chang, Yu. $ 

245; 10;6 880-02 $a Chuang feng pan sha / $c Chang Y'u chu. $ 
250; ;6 880-03 $a Ti 2 pan. $ 

260; ;6 880-04 $a Hsiang-kang : $b CtTin + yuan clTu pan she, 

$c 1995. S 
300; ;a 207 p. ; $c 17 cm. $ 

490; 1 ;6 880-05 $a Chang Yu hs'uan huan hsi lieh ; $v 18 $ 
500; ;a Short stories. $ 

800; 1 ;6 880-06 $a Chang, Yu. $t Chang Y'u hs'uan huan hsi 

lieh ; $v 18. $ 

880; 1 ;6 100-01/$1 $a !=G!:e. $ 

880; 10;6 245-02/$l $a !We!Lcie6!2 / $c !=G!:e!UN. $ 
880; ;6 250-03/$l $a !Os2!JH. $ 

880; ;6 260-04/$l $a !a)!H$ : $b ! 49+1QZ ! 3K! JH!Na, $c 

1995. $ 

880; 1 ;6 490-05/$l $a ! =G ! : e !K0 ! <h 2 li ! 3W ; $v 18 $ 

880; 1 ;6 800-06/$! $a !=G!:e. $t I=G1 ;elK0 Kh 2 lil 3W ; $v 18. $ 

Figure 1. Example of Authority Verification Process on a Nonunique Chinese Heading. 



with different diacritics or without dia- 
critics, s u ch as p 'i an d ;?i ( t 'i an d * t , & o and 
ko, and so on, stand not only for different 
pronunciations within each pair but dif- 
ferent meanings as well. This design in 
the Wade-Giles scheme generates a high 
volume of unrelated Chinese characters 
that have one identical mm an form ex- 
cept for the distinguishing diacritic; J 
marks; for instance, ch'un, chiXn, chtin; 
ch uan and chuan\ yu and yu, etc. 

NACO Authority Practice and 
Normalization 

Authority records contributed to the LC 
authority file are created by LC and 



NACO participating institutions. When 
creating authority records for the LC au- 
thority (lie, NACO catalogers must follow 
a set of LC guidelines to ensure the integ- 
rity and quality of the large shared file. 
Rules ol normalization are one set of the 
guidelines that catalogers apply to avoid 
creating duplicate headings. 

Normalization is a process that is de> 
signed to eliminate all but the essential 
ch aracters of a heading for the purpose of 
comparison (Cataloged Desktop 1998). 
Normalization programs do this by re- 
moving all diacritics and punctuation and 
converting all letters to upper case (see 
figure 2). 

Under this normalization rule and 
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Catalog form: 
Normal i zed f orm : 



Ile-de 
ILE DE 



■Montreal 
MONTREAL 



(Quebec) 
QUEBEC 



Figure 2. Example of Normalized Heading. 

NACO guidelines, a new heading is cre- 
ated only when the normalized form does 
not match the normalized form of an al- 
ready established heading in the LC au- 
thority file. If its normalized form is found 
in the authority file, the heading is consid- 
ered established. When applying normal- 
ization to Chinese transliterated names, 
diacritics, which are essential for distin- 
guishing Chinese syllables under the 
Wade-Giles system, are considered not 
essential and therefore are eliminated 
from the heading comparison process. If 
headings are found normalized to the 
same form, they are put together in one 
authority record (see figure 3). 

Under NACO practice, the Chinese 
names in example 1 of figure 3 share one 
authority record. The authorized heading 
for these two names will either be Cheng, 
Ch'en, or Ch'eng, Chen, depending upon 
which name gets created first in the au- 
thority file. Likewise, the Chinese names 
in example 2 of figure 3 have one author- 
ity record, and the established heading 
takes the diacritics associated with the 
name first created in the authority file. 

When dealing with a new nonunique 
name whose normalized form matches an 
established heading in the authority file, 
NACO catalogers do nothing to the estab- 
lished heading but revise the existing au- 
thority record in the note area (670 fields in 
the authority record) to include new infor- 
mation on additional names and 
their related citations. New ref- 
erence headings for different 
names are added to the author- 
ity record when appropriate. If 
the existing authority record is 
coded as a unique name, it is 
changed to a nonunique name 
record when new names are 
found that share the same nor- 
malized form. 

For instance, in the above 
examples, when the new name 
Cheng, Ch'en appears in the 
cataloging source and Ch'eng, 
Chen is found already estab- 



lished as an authorized 
heading in the LC author- 
ity file, Cheng, Ch'en is 
only added to the note 
area. Ch'eng, Chen remains as the estab- 
lished heading for this record. Cross- 
references related to Cheng, Ch'en can 
be added to the record as long as they do 
not normalize to the established form. 

If unaware of this practice for non- 
unique names, one is likely to interpret 
Ch'eng, Chen as it appears in the autho- 
rized heading field of the authority record, 
as the authoritative form for Cheng, Ch'en. 
Likewise, one might take Chang, Ch'iian, if 
established first, as the established author- 
ity heading for Chang, Chuan and Chang, 
Chuan (see figures 1 and 4). 

Cataloging for Nonunique 
Chinese Names 

As a rule, catalogers choose LC- 
established headings over name forms 
found in the cataloging source for head- 
ing forms in bibliographic records to 
control quality and uniformity of access 
points; yet in creating bibliographic head- 
ing forms for Chinese nonunique names, 
this is not always the case. In the example 
of the heading Ch'eng, Chen, when the 
new name Cheng, Ch'en appears in the 
cataloging source, for which a nonunique 
authority heading is found established 
under Ch'eng, Chen, instead of using the 
established heading Ch'eng, Chen, the 
cataloger enters Cheng, Ch 'en as the au- 
thor's name in the bibliographic record to 



1. 

a . 


Catalog form: 


Cheng, 


Ch'en 




Normalized form: 


CHENG, 


CHEN 


b. 


Catalog form: 


Ch'eng, 


Chen 




Normalized form: 


CHENG, 


CHEN 


2. 

a . 


Catalog form: 


Chang , 


Ch 'flan 




Normalized form: 


CHANG, 


CHUAN 


b. 


Catalog form: 


Ch 'ang. 


Chuan 




Normalized form: 


CHANG, 


CHUAN 


c . 


Catalog form: 


Chang, 


Ch 'uan 




Normalized form: 


CHANG, 


CHUAN 



Figure 3. Normalization in Chinese Name Headings. 
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ARN: 


681573 










Rec 


stat: 


Q 


Entered: 19820208 






V 


Type 




2 


Upd status : a Enc lvl : n 




Source: 




Roman : 


g 


Ref status: a Mod rec: 




Name use: a 




Govt 


agn: 




Auth status : a Sub j : a 




Subj use: a 




Series : 


n 


Auth/ref: a Geo subd: n 




Ser use: b 




Ser 


num: 


n 


Name: a Subdiv tp: ■ 




Rules : c 1 


r 


1 


010 




n 81136731 11 






r 


2 


040 




DLC *c DLC 11 






V 

r 


3 


005 




19840912155140.5 1 






r 


4 


100 


10 


Y'u, Yu-jen, *d 1879-1964. 1 






► 


5 


400 


10 


Y"u, Yu-jen, *d 1878-1964 *w nnaa 1 






► 


6 


400 


10 


Yu, Youren, *d 1879-1964 11 






► 


7 


400 


10 


U, Y'ujin, *d 1879-1964 11 






► 


8 


670 




His Yu-jen shih ts f un, 1956. 11 






► 


9 


670 




Liu, Y.T. Min kuo Y'u Yu-jen hsien 


sheng nien p c u, 










1981: *b p. 1 (Y'u Yu-jen, b. 1879) 


1 




► 


10 


670 




Biog. diet, of Repub. China, 1971: 
(Y'u Yu-jen, 4-11-1879-11-10-1964) 


*b 
1 


v. 4, p. 74 



Figure 4. The Unique Name Authority Record (See Name: a in the Fixed Field) 



reflect the Chinese characters. 

This practice is necessary in cataloging 
Chinese materials for the purpose of 
appropriately matching roman data to the 
corresponding vernacular data in the 
MAchine-Readable Cataloging (MARC) 
record. The current negative authority 
control result found on nonunique Chi- 
nese names occurs because available ven- 
dor software does not take into account 
this special cataloging practice for non- 
unique Chinese names. 

Vendor-Supplied Authority 
Control Services 

In 1995, three vendors, Library Technol- 
ogy Inc. (LTI), Blackwell North America 
(BNA) (now OCLC Authority Control 
Services), and the Washington Libraries 
Network (WLN), were invited to UNC to 
demonstrate their authority services. 
During their presentations, LTI, BNA, 
and WLN displayed similar approaches 
and software designs in processing au- 
thority control for local files. 

LTI was chosen to provide vendor 
authority control for UNC. Its procedure 
for authority control is as follows (LTI 
1998): 

First, headings are "normalized" for spac- 
ing and punctuation to increase the proba- 
bility of a link between a catalog record 
heading and an authority record heading, 
Next, the normalized headings are com- 
pared against a comprehensive index of 



authorized and variant-form headings gen- 
erated from authority records. When a 
match occurs, a link is created between the 
authority record heading and the catalog 
record heading. If the catalog record head- 
ing matches a see reference in an authority 
record, the catalog record heading is re- 
placed by the heading in the LC authority 
record. 

Existing authority control software 
lacks special instructions that would 
account for the special cataloging prac- 
tice for nonunique Chinese names. All 
nonunique Chinese names in biblio- 
graphic records that are normalized in the 
same way as authority headings (but that 
dif fer from them in diacritics) are treated 
as incorrect forms, and thus are changed 
to authority headings. As a result, after 
vendor authority control processing, one 
finds the roman data doesn't match the 
vernacular data for nonunique names in 
the bibliographic record. 

Taking again the example of Ch'eng, 
Chen, under the current vendor sof tware 
process, because Ch'eng, Chen is the 
established heading, all occurrences of 
Cheng, Ch'en will be changed to the 
authority heading Ch'eng, Chen in the 
catalog after the authority control service. 

LTI, BNA, and WLN all agreed that 
they have this problem in their services . It 
appears that on the current market there 
are no available commercial vendors who 
offer authority control services that take 
this situation into consideration. 
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Wade-Giles, Pinyin Schemes, 
and Normalization 

Normalization, the current industry stan- 
dard for database management — which 
by design aims for effective data storage 
and management — requires that the rela- 
tionship between the related elements 
(the primary key and the data) is one- 
to-one or many-to-one, but never one- 
to-many or many-to-many (Dutka 1989). 
That is to say between personal names 
and authority records, normalization 
functions effectively when one authority 
record relates to one unique personal 
name, or one authority record stands for 
one unique person with multiple names, 
but not vice versa when multiple unique 
personal names are represented by one 
authority record. 

The application of the Wade-Giles 
scheme in current cataloging and NACO 
practice for nonunique names shows a 
breakdown with the principle of normal- 
ization. Diacritics used in the Wade-Giles 
scheme to represent the uniqueness of 
sounds and words are considered not 
essential in the existing normalization 
rules and thus are excluded from the pro- 
cess. This explains why procedures that 
occur after the normalization process in 
authority-control services cannot process 
nonunique Chinese transliterated names 
properly. 

The Wade-Giles scheme will soon be 
dropped for cataloging work, and the neg- 
ative effect associated with it under nor- 
malization will be eased substantially 
when the new Pinyin scheme is adopted. 
Nevertheless, it will not disappear. 
Compared to the Wade-Giles scheme, the 
Pinyin scheme reduces the use of diacritics 
down to the umlaut for distinguishing cer- 
tain unique Chinese characters. For in- 
stance, the Chinese character "green" is 
transliterated in Pinyin as lii, while the Chi- 
nese character "furnace, stove" is transhter- 
ated in Pinyin as lu. Under current normal- 
ization, lii and lu again appear as the same 
form. Therefore without software up- 
grades or modification in the translitera- 
tion scheme, the nonunique name syn- 
drome in the authority control process will 
continue to happen with Pinyin data. 



Pinyin Conversion 

Before the year 2000 (when LC will start 
converting its database to the Pinyin 
scheme), libraries preparing to follow LC 
in converting their local files to Pinyin not 
only should undertake correction of all 
nonunique Chinese names that are af- 
fected negatively by the authority control 
process as described above but also 
should work on correcting diacritics in 
the bibliographic records to prevent data 
distortion in the conversion process. 

The National Library of Australia 
(NLA) in 1996 successfully converted its 
CJK data from the Wade-Giles scheme to 
the Pinyin system with computer technol- 
ogy. After closely observing the process 
and the conversion result at NLA, LC an- 
nounced that its Pinyin conversion pro- 
ject would be carried out using the same 
approach as NLA's. 

Describing the CJK conversion pro- 
ject in detail, Groom (1997) wrote that 
one key element in designing the software 
program for Australia's Pinyin conversion 
project was the setting up of a conversion 
table that listed and correlated all valid 
Wade-Giles words with their Pinyin coun- 
terparts. With the conversion table in 
place, the conversion program was thus 
able to identify and subsequently convert 
all Wade-Giles words to their Pinyin 
counterparts when run against the local 
bibliographic file. 

The conversion table was written pre- 
cisely based on the Wade-Giles rules to 
map each Chinese character to its Pinyin 
counterpart, taking into account the exact 
use of Wade-Giles diacritics and their po- 
sition in the syllables, etc. The conversion 
program carries out the conversion only 
when the bibliographic data match cor- 
rectly to the conversion table. This is to 
say local bibliographic records submitted 
for conversion are required to have the 
same Wade-Giles forms and diacritics as 
the conversion table to guarantee accu- 
rate data conversion. Any spelling errors 
in roman forms or misused diacritics will 
cause incorrect conversion or no conver- 
sion at all. 

Groom (1997) reported that conver- 
sion errors were found when records con- 
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Suggestions 

The inadequacy of vendor-supplied au- 
thority control of Chinese name headings 
demonstrates an immediate need for the 
East Asian library community to find so- 
lutions for the incompatibility between 
the normalization rules governing the 
setup of Chinese personal headings in the 
NACO program and the design of the 
standard transliteration scheme used in 
North America for transcribing Chinese. 
Through examination it appears that a 
direct method for resolving such incom- 
patibility is simply to make all Chinese 
personal headings unique iti the LC au- 
thority file. Use of unique headings not 
only allows Chinese names to be pro- 
cessed correctly under normalization 

As a means of doing this and solving the 
current authority control problem, sev- 
eral approaches are possible: improving 
and developing new system software; 
completing NACO headings with dates; 
providing clear instructions regarding the 
use of Roman forms for authorized head- 
ings; and eliminating diacritics from the 
Pinyin scheme. 

Improving and Developing New 
System Software 

Before a permanent way is found to deal 
effectively with romanized Chinese 
names in the LC authority file, in the 
short term we need to request that a new 
algorithm be developed immediately in 
vendor software that can halt the incor- 
rect replacement of nonunique Chinese 
names in the bibliographic records with 
authority heading forms. One way of up- 
dating the current vendor software for 



this purpose is to incorporate the NA\. 
element in the fixed field of the 
USMARC Authority Format record into 
the new algorithm so that the software 
will perform global replacement of cata- 
log forms only to those headings that are 
linked to unique name records. Headings 
that are linked to nonunique authority re- 
cords should be marked and stored in a 
file for further processing. The review of 
unprocessed nonunique names should 
not be a heavy burden for catalog mainte- 
nance because most of these names delib- 
erately are kept in those forms by human 
decision at the time of cataloging. Thus, 
the concern over the error rate for those 
names is small. 

In the longer term, software must be 
developed that incorporates vernacular 
data in the normalization process. Adding 
vernaculars to authority records has been 
advocated by the East Asian library com- 
munity to solve problems of differentiat- 
ing Chinese names given the high volume 
of homonyms found in transliterated data 
in authority records (Yu 1996; Morimoto 
1996). 

To increase the uniqueness of Chi- 
nese names in the LC authority file, the 
new software should not only have the 
capability of storing and displaying ver- 
nacular data, but should also include a 
mechanism that takes vernacular data 
into consideration in the normalization 
process. That is, an additional procedure 
should be devised that would involve 
comparing vernacular data for headings 
that are identical when normalized. 
Headings that would otherwise be con- 
flated could be kept separate when the 
vernacular data do not match. Authority 
control software would follow the same 
design for heading comparison and cor- 
rection. This extra procedure would sub- 
stantially improve effectiveness in con- 
trolling Chinese names and prevent false 
conversions during the authority control 
process. 

A more cooperative effort must be 
made through the NACO program to 
improve Chinese name heading quality. 
Before any new updates are developed in 
the software, NACO catalogers and the 
NACO program can reduce or prevent 



tained unexpected diacritics. In some 
cases, diacritics were not input as pre- 
scribed in the Wade-Giles scheme, with 
the result that no conversion took place 
because a diacritic discrepancy occurred 
between the catalog entries and the con- 
version table. In other cases, diacritics 
were omitted in the NLA catalog entry, 
resulting in an erroneous conversion to 
another heading. 
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the normalization impact to Chinese 
names in the meantime by making the 
heading unique in ways such as: complet- 
ing NACO headings with dates; providing 
clear instructions regarding the use of ro- 
man forms for authorized headings; and 
modifying the Pinyin scheme to eliminate 
the use of diacritics. 

Completing NACO Headings 
with Dates 

Adding birth and death dates to Chinese 
romanized names is very effective for es- 
tablishing unique headings in the LC au- 
thority file. All NACO catalogers should 
be encouraged or required to provide 
such data to Chinese headings when avail- 
able. To make this a common and practi- 
cal practice in NACO work, not only 
should the NACO program be expanded 
to include more East Asian catalogers to 
share the burden and expense of such 
authority work, but a global effort should 
be made to solicit information from li- 
braries, publishers that can provide such 
information through Internet, or author 
introductions in published books (Eastlib 
Discussion List 1998). 

Providing Clear Instructions 
Regarding the Use of Roman 
Forms for Authorized Headings 

Instructions must be provided about 
whether to use roman forms that appear in 
the source or author-supplied roman 
forms as authoritative forms instead of us- 
ing systematic transliterated forms from 
the Chinese scripts. The Anglo-American 
Cataloguing Rules, 2d ed., 1988 revision 
rule 22.3C2, and its alternative rule, which 
direct the setup of Chinese headings in 
NACO practice are insufficient for cata- 
logers to handle the increasingly common 
situation where, in the cataloging source, 
the romanized form of a name and its Chi- 
nese script coexist on the title page. 

Due to a lack of guidelines in the 
NACO procedures for such circum- 
stances, the choice of forms for these 
names varies among catalogers. Some 
headings are established under 22.3C2 
and its alternative rule, using the most 



popular form of authors as known by 
readers and in reference sources. For 
some headings lacking reference sources 
or guidance in the NACO procedure, it is 
dif fie ui t to co m p ly fully with chapter 22 to 
establish unique Chinese headings, and 
catalogers refer to Rule LO for guidance, 
which prescribes transcribing the state- 
ment of responsibility of a work in the lan- 
guage and script in which it appears. As a 
result, the former rule usually brings the 
roman forms already supplied by the 
source as authority headings, while the 
latter rule results in headings that are 
systematically transliterated from the 
original scripts. 

Roman forms systematically transliter- 
ated from Chinese scripts, as discussed 
earlier, frequently result in a high volume 
of homonyms, thus producing nonunique 
headings. If NACO guidelines clearly pre- 
scribe 22.3C2 and its alternative to take 
precedence over rule 1.0E in establishing 
authority headings in such situations, 
more unique name headings will result. 

The author or publisher-supplied ro- 
man forms appearing in the source not 
only indicate the authors' choice of forms 
in their names, but are usually formulated 
less systematically and predictably than 
those generated according to the translit- 
eration scheme. Therefore, they are less 
likely to become nonunique names in the 
authority file than names formulated by 
the transliteration scheme. Such practices 
also reduce the confusion of public catalog 
users who recognize authors' names in 
their roman forms from publications 
rather than the forms from the prescribed 
transliteration scheme. Figure 5 displays 
the difference between names transliter- 
ated with the Wade-Giles scheme and 
those supplied by authors or publishers. 

Under the Wade-Giles scheme, Tung 
Chien-hua represents hundreds of differ- 
ent Chinese characters for names, but the 
combination of Ttmg Chee Hwa together, 
so far, only represents one personal name. 
Likewise, Ng Chi-sum, formulated under 
the Cantonese pronunciation, is drasti- 
cally different from Wu Chih-sen, formu- 
lated under the standard Mandarin sound 
on which most transliteration schemes for 
Chinese are based. 
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Eliminating Diacritics 
from the plnyin scheme 

The Pinyin scheme will no doubt work 
much better in normalization for Chinese 
name headings than the current Wade- 
Giles scheme. Not only does Pinyin allow 
each sound to be associated with a 
unique roman symbol, but it also uses 
fewer diacritics. The use of diacritics to 
distinguish a pair of vowels or aspiration 
in Chinese has made Wade-Giles very in- 
effective in normalization. Thus, the only 
diacritic introduced for use in the incom- 
ing Pinyin scheme — the umlaut — would 
need to be eliminated and replaced with 
an alternative expression in the scheme. 
Repeating the roman letter that the um- 
laut attaches to is one alternative to repre- 
senting the sound with umlaut. For in- 
stance, lil would be presented as luu. In 
the pair of words lu (meaning "green" in 
Chinese) and lu (meaning "stove" in Chi- 
nese) can thus be distinguished as luu and 
lu in normalized form. This practice is 
common in publications and in other 
transliteration schemes used in Taiwan, 
Japan, and in the Macintosh Chinese lan- 
guage kit, for example . To allow the access 
of Pinyin headings where standard um- 
lauts are used, Pinyin entries with stan- 
dard umlauts should be entered as cross- 
references in authority records. 

Data processing for nonroman lan- 
guages, especially Chinese, in areas of li- 
brary services is by no means easy. But 



with careful design in software to incor- 
porate Chinese vernacular data, adjust- 
ment to transliteration schemes, and co- 
operation among NACO catalogers to set 
up unique personal headings, I am confi- 
dent that Chinese names can be pro- 
cessed as effectively as names in other 
languages in the online environment. 
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Classifying Newspapers Using Dewey 
Decimal Classification 



Tamara J. Kuhn 



Librarians facing the decision of whether or not to classify a newspaper col- 
lection must consider the basic research needs of patrons. Users primarily ac- 
cess newspapers by geographic location (more specifically, by city of 
publication ), and by date. Librarians also must consider that newspapers are 
serials and that classifying newspapers presents the same challenges as clas- 
sifying general serials. Multiple title changes and unique relationships 
among newspapers can complicate the application of a classification system 
to a newspaper collection. Keeping newspaper families together chronologi- 
cally on shelves is important when locating titles. The Dewey Decimal Classi- 
fication system, with a few modifications, can be used optimally to classify a 
newspaper collection. Knowing that classification canfunction as an efficient 
organization methodfor a newspaper collection makes a library's decision of 
whether or not to classify easier. 



JL he Illinois Newspaper Project (INP) 
moved to the University of Illinois at 
Urbana-Champaign (UIUC) in January 
1996 to begin cataloging the library's 
United States newspaper collection. As a 
part of the United States Newspaper 
Program (USNP) funded by the National 
Endowment for the Humanities, INP 
shifted its focus from cataloging the exten- 
sive Illinois newspaper collection at the 
Illinois State Historical Library in Spring- 
field, Illinois, to cataloging a much broader 
United States collection in one of the larg- 
est academic libraries in the nation. 

One major obstacle discovered while 
evaluating the UIUC newspaper collec- 
tion was theneed to classify newspapers as 
well as provide description and subject 



analysis. Existing rules and guidelines 
provided by USNP and CONSER gov- 
erned the latter two elements, but classi- 
fying newspapers was an entirely new 
realm. Because INP is only cataloging 
United States newspapers, I will focus on 
classifying U.S. titles. 

Newspapers as Serials 

Newspapers are a specific form of serial, 
and the question of whether or not to clas- 
sify a serials collection applies directly to 
a newspaper collection. Much has been 
written on classifying serials and periodi- 
cals, and authors debate the pros and cons 
of arranging a serials collection in call 
number order. In an article discussing 
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whether or not to classify serials, Smith 
(1978, 372) wrote, "Classification is in- 
deed a very basic library approach to ma- 
terials, and it cannot be overlooked. Seri- 
als collections and their use are much too 
important to be given the short shrift. 
Classification is imperative." Newspapers 
need to be given the same consideration 
in library collections as other materials, so 
a library's decision to classify serials 
should also include newspapers. 

Smith (1978) argued that serials 
should be classified for seven reasons, 
many of which are still relevant today: 

1. Materials on the same subject should 
be kept together, regardless of type, 
to facilitate subject access and brows- 
ing. 

2. Classification encourages broader 
use of serials by treating them like 
other materials. 

3. Serials reference work will be aided 
by classification. 

4. Sorting call numbers can more easily 
produce subject aids and guides. 

5. Reshelving becomes a mechanical 
task, and shelvers are not required to 
know special organization rules. 

6. Materials that have undergone title 
changes can remain on the shelves to- 
gether. 

7. Classification can help prevent over- 
crowding in the subject catalog due to 
successive entry cataloging. 

Bross (1990) listed some benefits of 
classifying a periodical collection (178): 
"A classification system enables a library 
to keep volumes of the same periodical to- 
gether on the shelves in spite of title 
changes." Newspapers are notorious for 
changing titles many times within a life- 
span, and using a classification system to 
organize a newspaper collection is one 
way to manage title changes on shelves 
and in local catalogs. Bross also stated 
(178) that "another benefit is the assis- 
tance a classification number can give 
staff/patrons who may be faced with the 
problems of correlating catalog entries, 
spine titles, filing rules, and new catalog- 
ing codes." Newspapers often merge to 
become new bibliographic entities or 
combine in various ways to form different 
manifestations of an earlier title. Clas- 



sifying newspapers can help staff and pa- 
trons navigate through shelves without 
having constantly to consult the catalog 
for exact titles and title changes. Finding 
newspapers becomes a matter of follow- 
ing a base call number instead of remem- 
bering all of the various titles within a 
newspaper's lifespan. 

Classifying Newspapers at UIUC 

Patrons access newspapers primarily by 
place of publication. Many times a re- 
searcher does not have a specific newspa- 
per title in mind to fill an information 
need, but instead knows that a newspaper 
from a given area likely will contain the 
needed information. Historians, for ex- 
ample, can utilize newspapers from dif- 
ferent geographic locations to compare 
how an event was covered at the time that 
it occurred. Location and date are most 
important in this type of reference re- 
quest. Basing classification on geographic 
location can play a vital role in organizing 
and accessing a newspaper collection to 
expedite a researcher's information need. 
Arranging a newspaper collection geo- 
graphically is logical in relation to the 
manner in which newspapers are re- 
quested by patrons. 

In many catalogs, the call number 
serves as an additional access point. When 
accessing the UIUC newspaper collection, 
searching the catalog by call number is a 
very important function because most ma- 
terials are housed in closed stacks areas. 
Patrons who know a general call number 
can search that number and browse the 
catalog by shelf arrangement. This serves 
them as if they were browsing the shelves 
in search of other newspaper titles from 
the same geographic area. In collections 
that allow patrons to retrieve their own 
materials, arranging newspapers geo- 
graphically expressed through classifica- 
tion will fulfill a basic research need. 

Organizing title changes through clas- 
sification facilitates browsing a newspa- 
per collection. Patrons can find the classi- 
fication number of a title in the middle of 
a newspaper family and use that number 
as an entry point to the shelves. Once 
patrons locate that call number on the 
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shelves, they can easily move from title to 
title within that family without having to 
know the intricacies of title changes. 

The decision to classify the newspaper 
collection at the UIUC library was com- 
pounded by the mechanics of the local 
online catalog. The Full Bibliographic 
Record/Library Computer System (FBR/ 
LCS), the online catalog that UIUC used 
at the time this article was written, re- 
quired a call number to function properly. 
The catalog consisted of two separate da- 
tabases and software programs; one for 
bibliographic records (FBR) and one for 
holdings records (LCS). The linking com- 
ponent in this system was the classifica- 
tion number, and without one, patrons 
could not move easily between biblio- 
graphic and holdings records. Because of 
this (in addition to the need for geo- 
graphic arrangement), it became clear 
that classification was essential in provid- 
ing local access to the UIUC newspaper 
collection. 



Using the Dewey Decimal 
Classification System 

Librarians at the UIUC library classify the 
genera] collection using Dewey Decimal 
Classification (DDC), and the newspaper 
collection needed to fit into this scheme. 
Before evaluating the classification 
scheme in relation to organizing newspa- 
pers^ literature search revealed that only 
one article had been written concerning 
newspaper classification, and it dealt only 
with Library of Congress classification 
(Krissiep 1991 ). There were no previously 
published articles on using DDC to clas- 
sify a newspaper collection to help guide 
the INP. Few articles have been written 
concerning newspaper classification be- 
cause newspaper cataloging is relatively 
new to librarianship and few newspaper 
collections are currently classified. 

The USNP, formalized in 1984, was the 
first coordinated nationwide eff ort to cata- 
log newspaper collections. Rules specific 
to newspaper description and subject anal- 
ysis were established to facilitate this pro- 
gram (Field 1986). Classification, how- 
ever, was not a formal part of the USNP. 

Before examining DDC's application 



to newspapers, it was important to look at 
the existing organization of the UIUC 
newspaper collection. Like many news- 
paper collections, UIUC's collection was 
arranged primarily by geographic loca- 
tion. The first level of geographic organi- 
zation was by country. Then the newspa- 
pers were organized alphabetically by 
state, and then alphabetically by city 
within each state. The next element of or- 
ganization was by title. A common title, 
which is a shortened form in popular use 
by the public that does not reflect title 
changes, had been assigned to most news- 
papers and alphabetized within each city. 
For example, the Chicago Sun-Times un- 
derwent six title changes (including one 
merger) between 1941 and the present; 
however, the paper is commonly known 
as the Chicago Sun-Times or the Sun- 
Times. All boxes of microfilm for the 
Sun-Times held in the UIUC newspaper 
collection were labeled as Chicago Sun- 
Times, without acknowledging the nu- 
merous title changes. 

After the alphabetization of each com- 
mon title within its city, newspapers were 
arranged chronologically. This organiza- 
tion system became the basis for evaluat- 
ing DDC to see how useful the classifica- 
tion scheme could be in organizing a 
newspaper collection. 

Evaluating DDC in Relation to a 
Newspaper Collection 

The evaluation uncovered both strengths 
and limitations. The main strength of the 
DDC to classify a newspaper collection is 
its geographic organization capability in 
applying Table 2: Geographic Areas, 
Periods, Persons. It is possible to build a 
number expressing country, state, region, 
and county. Figures 1^1 illustrate the ap- 
plication of Table 2. Another DDC 
strength is that all general newspapers 
can be found within a small range of num- 
bers. General newspapers from the 
United States are found in the 071 classi- 
fication number. In an open stacks collec- 
tion, this will facilitate browsing. 

One limitation was found, but it 
proved to be crucial in applying DDC in a 
strict manner. In most cases, geographic 
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Champaign daily news 

Merged with: Champaign daily gazette (Champaign, 111. : 1883) 
To form: Champaign news-gazette 

Vol. 1, no. 1 (July 29, 1895)-v. 25, no. 116 (Dec. 13, 1919) 

071 Journalism and newspapers in North America 

.7366 Journalism and newspapers in the United States 

Number from Table 2 representing Champaign County, 
Illinois (the first 7 is dropped according to the 
instructions in .3-. 9) 
C357 Cutter number representing Champaign (city of publication) 

cd Workmark taken from Champaign daily news 

Figure 1. Application of DDC, Modified Cutter and Workmark. 



arrangement in the scheme only allows 
access as far as the county level within the 
United States. Larger cities such as Chi- 
cago or New York have their own number 
in Table 2, but almost all average- to 
small-sized cities do not. Because of this, 
the city of publication would be elimi- 
nated from the organization of the collec- 
tion, and the final level of geographic 
arrangement for a U.S. newspaper would 
be the county. Patrons very rarely request 
newspapers by county, and frequently re- 
quest them by city. For example, a patron 
is more likely to ask for newspapers from 
Springfield, Illinois than from Sangamon 
County, Illinois. 

Traditionally, the cutter number is 
created based on the main entry of a cata- 
log record, and for newspaper records the 
main entry is always the tide, either in the 
form of the title proper or a uniform title. 
Using DDC in a strict manner and 
cuttering traditionally would make the ti- 
tle more important in the arrangement 
than the city. For example, if a strict and 
traditional method of applying DDC and 
cuttering were to be used to classify a 
newspaper collection, all newspapers 



from Cook County, Illinois outside of 
Chicago would be arranged by title with 
no city arrangement whatsoever. Hun- 
dreds of Chicago suburban newspapers 
would file alphabetically by title beneath 
the county arrangement. Skokie newspa- 
pers would be mixed alphabetically by ti- 
tle with Caiumet City papers and so forth. 
However, newspapers from Chicago 
would be separated from other papers 
from Cook County, because Chicago has 
its own number in Table 2. No city access 
is uniformly available through strict use of 
DDC and traditional cuttering methods. 

Modifying the Traditional 

After finding this major stumbling block, it 
became apparent that DDC had to be mod- 
ified to make it a sensible choice for classi- 
fying a newspaper collection. It was possi- 
ble to use certain elements in a strict 
manner, though. The subject number 071 
and the use of Table 2 were applied as in- 
structed in the DDC schedules. As stated 
above, this application generally only allows 
state, region, and county arrangement. 
The cuttering method was modified to 



Champaign daily gazette (Champaign, 111. : 1883) 
Merged with: Champaign daily news 
To form: Champaign news-gazette 

Vol. 1, no. 1 (Nov. 5, 1883)-<no designation (Dec. 11, 1919)> 



071 Journalism and newspapers in North America 

.7366 Journalism and newspapers in the United States 

Number from Table 2 representing Champaign County, 
Illinois (the first 7 is dropped according to the 
instructions in .3-. 9) 
C357 Cutter number representing Champaign (city of publication) 

cda Workmark taken from Champaign daily gazette 

The workmark "cda" is used because "cd" already repre- 
sents Champaign daily news cataloged prior to this title. 

Figure 2. Application of DDC, Modified Cutter and Workmark. 
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Champaign news-gazette 

Formed by the union of: Champaign daily news 

And: Champaign daily gazette (Champaign, 111. : 1883) 

Continued by: Champaign-Urbana news-gazette (Champaign, 111. : 1934) 

Vol. 25, no. 117 (Dec. 14, 1919)-v. 39, no. 305 (July, 27, 1934) 

071 Journalism and newspapers in North America 

.7366 Journalism and newspapers in the United States 

Number from Table 2 representing Champaign County, 
Illinois 

(the first 7 is dropped according to the instructions in 
.3-. 9) 

C357 Cutter number representing Champaign (city of publication) 
cd Workmark from previous title (Champaign daily news) 
1 Numeric extension used to keep title changes filing 
chronologically 

Figure 3. Application of DDC, Modified Cutter and Workmark, 



make certain that the city of publication 
was included in the classification arrange- 
ment. Choosing a cutter number based on 
the newspaper's place of publication in- 
stead of main entry (title for newspaper 
records) ensures that the city is a standard 
part of the classification number. This 
provides the city level of geographic 
arrangement that is so important to news- 
paper organization. With this modifica- 
tion, the classification number parallels 
the organization of the hierarchical place 
name added entry, which is a core ele- 
ment in newspaper cataloging. The first 
geographic area in this data element is 
country. The second area is state. The 
third area is county, and the last area is city. 

Generally, the cutter number is based 
on the place of publication found in the 
imprint. A newspaper can change places 
of publication within its lifetime, in which 
case it is up to the cataloger to decide 
which place of publication to use for the 
basis of the cutter number. For example, 



the title Times of Havana began publica- 
tion in Havana, Cuba, on February 4, 
1957, but by April 20, 1961, it had moved 
to Miami, Florida. The UIUC library 
holds issues beginning on April 20, 1961, 
but nothing before that date. It was logi- 
cal to cutter this title based on Miami, 
Florida, rather than Havana, Cuba, given 
that none of the library's holdings repre- 
sent the time period in which this title was 
published in Havana. 

The next element in the classification 
number remained the same as in a tradi- 
tional application. The workmark is as- 
signed from the title of the newspaper. 
This provides the alphabetical arrange- 
ment by title within the city of publication 
that is generally expected in newspaper 
organization. Because so many newspa- 
per titles begin with a city name (Cham- 
paign News-Gazette from Champaign, 
111.) or frequency (Chicago's Daily Inter 
Ocean), a guideline was established to 
keep workmarks from becoming too long: 



Champaign-Urbana news-gazette (Champaign, 111. .- 1934) 
Continues: Champaign news-gazette 

Vol. 39, no. 306 (July 29, 1934)-v. 41, no. 333 (July 18, 1936) 

071 Journalism and newspapers in North America 

.7366 Journalism and newspapers in the United States 

Number from Table 2 representing Champaign County, 
Illinois (the first 7 is dropped according to the 
instructions in .3-. 9) 
C357 Cutter number representing Champaign (city of publica- 

tion) 

cd Workmark from previous title (Champaign daily news) 
2 Numeric extension used to keep title changes filing 
chronologically 

Figure 4. Application of DDC, Modified Cutter and Workmark. 
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Generally, if a title begins with a city 
name or a frequency, we use the first let- 
ter of the first word then skip to the first 
letter of the second word. A workmark 
should not extend beyond three charac- 
ters, if possible. Figures 1-2 provide ex- 
amples of distinguishing workmarks. 

Because newspapers change titles fre- 
quently, the last element in the classifica- 
tion number was developed to handle title 
change situations and to make sure that 
newspapers remained in chronological or- 
der. When a title change occurs, a numeric 
extension is added to the classification 
number of the earlier title following the 
workmark. The first time a title changes, a 
"1" is added to the previous classification 
number. The next time the title changes, a 
"2" is added, and so forth. This keeps titles 
together in chronological order even when 
publishers decide to change the title on the 
newspaper masthead. 

In cases of mergers, absorptions, or 
other complex relationships, the designa- 
tion or numbering system can act as a 



pointer. For example, when two newspa- 
pers merge to form another newspaper, a 
cataloger has three possibilities for classi- 
fying the later title. An extension can be 
added to the classification number of the 
first merger title, an extension can be 
added to the classification number of the 
second merger title, or a completely new 
classification number can be created for 
the newly formed title. In this situation, 
the numbering system can offer clues. If 
one designation is continued by the later 
title, and the other designation dies with 
its title, then it is best to continue the clas- 
sification number corresponding to the ti- 
tle whose designation continues. Figures 
1-3 illustrate this concept., On the other 
hand, if neither designation from the 
merging titles continue and a new num- 
bering system emerges, it might be best to 
create a new classification number alto- 
gether based on the new title. Often these 
tough situations are best resolved based 
on the cataloger's judgment and what 
works best within a collection. 



Champaign daily news 
1895-1919 



Champaign daily gazette 
(Champaign, 111. : 1883) 
1883-1919 




Champaign news-gazette 
1919-1934 



071.7366 
C357cdl 
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news -gazette 


(Champaign, 111. 


: 1934) 
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Figure 5. Family Tree for the News-Gazette from Champaign, Illinois, 
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Illinois labor press 

From East St. Louis, Illinois 

331 Labor economics 

.8809 Historical, geographic, persons treatment of labor unions 
773 Number from Table 2 representing Illinois 
Ea7 9 Cutter number representing East St. Louis (city of 

publication) 

i Workmark taken from Xllinois labor press 

Figure 6. Application of DDC, Modified Cutter and Workmark to Genre Newspapers. 



Figure 5 shows a portion of the newspa- 
per family tree for the News-Gazette from 
Champaign, Illinois. Figures 1 through 4 
illustrate each title and its assigned classifi- 
cation number. Together they demon- 
strate the use of DDC, the modified 
cuttering method, workmark, and numeric 
extension to classify a newspaper set. 

Genre Newspapers 

Not all newspapers carry only general 
news and serve a geographic area. Some 
newspapers cover specific topics and rep- 
resent certain genres, such as religious 
newspapers or labor newspapers. In cases 
like this, the genre or subject orientation 
of the newspaper is often more important 
to researchers and should supercede the 
place Or publication as the primary focus 
of organization. It is best to classify genre 
newspapers within the appropriate sub- 
ject area from the DDC schedules. 

Geographic coverage is still important 
in most topical newspapers, but the sub- 
ject of the newspaper needs to be ex- 
pressed through the base classification 
number. Whenever possible, use Table 2 
to make sure that any geographic cover- 
age is also expressed. This provides geo- 
graphic organization to a genre newspa- 
per collection. Using the same modified 
cutter, workmark, and numeric extension 
guidelines that were developed for gen- 



eral newspapers will keep the classifica- 
tion method consistent. It will also pro- 
vide the basic geographic organization 
that a newspaper collection needs. Fig- 
ures 6 and 7 illustrate two genre titles and 
their assigned classification numbers. 

OBSERVATIONS OF IMPLEMENTATION 

After the INP began classifying the UIUC 
newspaper collection, the Newspaper Li- 
brary started shifting the microfilm col- 
lection into call number order. It would 
have taken much effort and resources to 
shift close to 75,000 reels of microfilm at 
one time, so the library staff decided to 
shift the newspaper microfilm reels for 
each state as INP finished cataloging 
them. This distributed the work effort of 
this formidable task and broke up the pro- 
cess into logical phases. Each state was or- 
ganized into call number order, but the 
shelving locations for each state remained 
in alphabetical order. It is possible that 
the collection can be later shifted into 
strict call number order. 

Because the Newspaper Library mi- 
crofilm collection is housed in a closed 
stacks area, staff and student employees 
must retrieve all requested film for 
patrons. Now that the U.S. collection is 
cataloged and classified, patrons are able 
to find titles and call numbers more 
independently than before. Patrons can 



Jewish world (Fairview Heights, 111.) 

296 Judaism 

•09 Historical, geographic, persons treatment 

77389 Number from Table 2 representing Saint Clair County, 
Illinois 

F168 Cutter number representing Fairview Heights (city of 

publication) 

j Workmark taken from Jewish world 

Figure 7. Application of DDC, Modified Cutter and Workmark to Genre Newspapers. 
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provide the Newspaper Library staff with 
a call number to expedite film retrieval. 
By relying on call numbers, there is not as 
much room for error in retrieval as when 
microfilm was completely arranged alpha- 
betically by geographic location. Staff, 
however, must know a title's state of publi- 
cation before the call number can be used 
for locating the film. This sometimes slows 
the retrieval process. Arranging a newspa- 
per collection completely by call number 
would alleviate this additional step. 

Patrons do not always understand the 
concept of a newspaper title change and 
provide the Newspaper Library stall' with 
a call number of a preceding or succeed- 
ing title to the title they want specifically. 
Staff and student employees know, how- 
ever, that the call number can provide an 
entry point to the shelves for that newspa- 
per family. They can rely on the extension 
numbers to move easily through title 
changes on the shelves without knowing 
the titles themselves. 

Staff members have noticed a differ- 
ence in the time it takes for student em- 
ployees to reshelve microfilm reels. Em- 
ployees do not have to pre-arrange 
microfilm geographically as much as be- 
fore, because there is a more systematic 
arrangement provided by call number or- 
ganization. There are also fewer incidents 
of microfilm being shelved incorrectly 
since the collection has been shifted into 
call number order, according to staff. Stu- 
dent employees are instructed to match 
the call number to the correct shelf loca- 
tion and then check the exact title against 
neighboring film reels. This provides a 
mechanism for accuracy when shelving 
microfilm. 

Conclusion 

Librarians facing the decision of whether 
or not to classify a newspaper collection 
must consider patrons' basic research 
needs. Users primarily access newspapers 
by geographic location, more specifically 
by city of publication, and by date. Geo- 



graphic organization was the primary fo- 
cus when evaluating DDC as the classifi- 
cation scheme to use for the UIUC 
newspaper collection. Applying DDC in a 
strict manner cannot express consistently 
the city of publication of a newspaper ti- 
tle, which is so important in meeting us- 
ers' needs. Modifying the traditional use 
of the cutter number can overcome 
DDC's limited geographic expression. 

Librarians also must consider that 
newspapers are serials and that classify- 
ing newspapers presents the same chal- 
lenges as classifying general serials. 
Multiple title changes and unique rela- 
tionships among newspapers can compli- 
cate the application of a classification 
scheme to a newspaper collection. 
Keeping newspaper families together 
chronologically on shelves is important 
when locating titles. It is crucial that pa- 
trons and staff have the ability to browse 
the shelves without knowing the intrica- 
cies of every title change that occurs in 
the lifespan of a newspaper family. The 
addition of a numeric extension to a base 
call number provides chronological order 
to the classification scheme. 

DDC, with a few modifications, can be 
used optimally to classify a newspaper 
collection. Knowing that classification can 
function as an organization method for a 
newspaper collection makes a library's de- 
cision of whether or not to classify easier. 
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Library Gateway: Project 
Design, Teams, and Cycle Time 

Karen Calhoun, Zsuzsa Koltay, and Edward Weissman 

Librarians at Cornell University Library (CUL) launched the first 
systemwide integrated gateway to networked resou roes, services, and library 
information in January 1998. The system was created and introduced in just 
17 weeks. The Library Gateway was launched in response to a confusing 
C UL online presence; the library was spending half a milium dollars a year 
on online resources and devoting a substantial amount of staff time to provide 
access to them, but systemwide efforts were only loosely coordinated. In this 
article, we examine the design and implementation process that helped the 
project succeed, rather thanfocusing on the technology or the vision behind 
the gateway. Examining the gateway project as a case study of new product 
development, we identify and discuss critical success factors, including: 
starting with a clearly defined concept, the importance of buy-in, simulta- 
neous development of different portions of the project, employing 
cross-functionalteams, andseeking continual feedback. Workingtogether on 
a project of this complexity and on such a fast time line was an important or- 
ganizational learning experience that moved CUL a st.ep closer to mastering 
the process of innovation. 



JL he Cornell University Library Gateway 
(http://campusgw.Iibrary.corneII.edu/) (fig- 
ure 1) represents the first integrated ap- 
proach to providing Wet-based informa- 
tion services to the Cornell community, and 
as such its introduction in January 1998 was 
an important step forward. Yet the process 
by which the gateway was implemented 
was equally important to the library, whose 
managers and staff continually are striving 
to deliver new services as quickly and 
cost-effectively as possible. It has become 
critical to know, from an organizational 
perspective, what enhances the ability to 
deliver new services in time, on time. 



In this paper, we examine the gate- 
way project as a case study of project de- 
sign, teams, and rapid cycle time. While 
the technology of the gateway might not 
be transferable to other environments, 
the cross-functional process employed 
to introduce it can serve as a model for 
others. We favor not merely reconciling, 
but tightly integrating the work of tech- 
nical service librarians, public service 
librarians, collection development libra- 
rians, and information technology spe- 
cialists to solve mission-critical problems 
and to ensure the continuing success of 
the library. 
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Figure 1. The Cornell University Library Gateway. 



We are in a period of turmoil. Elec- 
tronic resources have turned library prac- 
tices upside down. The tried-and-true 
practice of organizing libraries into fairly 
independent functional groups (collec- 
tion development, public services, tech- 
nical services, and information technol- 
ogy) no longer yields acceptable results. 
For example, Duranceau (1997) com- 
pares the workflows of print and elec- 
tronic, networked serials and concludes, 
"The print purchase process is a short, 
straight garden path. The digital world is 
cyclical . . . communication, coordination, 
and team effort is required at almost ev- 
ery stage . . . [and] the purchase process is 
a long, complex, winding dirt road filled 
with potholes." Schroeder (1997) de- 
scribes the potential benefits of blurring 
territorial lines between public and tech- 
nical service librarians. Martin (1996) 
emphasizes the importance of healing the 
schism between technical and public ser- 
vices, and argues that the electronic li- 
brary provides a tremendous opportunity 
for public and technical service librarians 
to begin to share leadership roles. 



Context of the Gateway Project 

In 1868 Ezra Cornell, a plainspoken in- 
ventor inspired by egalitarian ideals, 
founded "an institution where any person 
can find instruction in any study," includ- 
ing not only the arts and sciences, but also 
applied technology and agriculture 
(Cornell 1999). Today, the 19 libraries of 
the Cornell University Library (CUL) 
system serve 7 undergraduate colleges 
and schools, 4 graduate and professional 
units, and 2 medical units. The hallmarks 
of this complex library system, which em- 
ploys more than 500 people, are diversity, 
excellent collections, and a history of 
semiautonomy for unit libraries. 

Until the late 1980s, CUL had a largely 
captive audience for its print collections 
of more than 6,000,000 printed volumes 
and more than 60,000 journals. Since 
then, the Internet has altered dramati- 
cally library users' perceptions and pref- 
erences for obtaining information of all 
kinds. Based on the experience of many 
reference librarians and the findings of a 
campus survey, an important shift in atti- 
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tude has occurred: students tend to per- 
ceive the effort of using print resources as 
higher than finding the information online. 

In the present dynamic environment, 
librarians in CUL unit libraries have com- 
mitted themselves to keeping pace with 
technological innovations and with their 
users' preferences for finding informa- 
tion. By early 1997, the 19 CUL libraries 
had mounted numerous separate "offi- 
cial" library Web sites to provide Web or 
telnet access to more than 900 online re- 
sources. The library was spending half a 
million dollars a year on online resources, 
and staff devoted a substantial amount of 
time to providing access to them, but sys- 
tem-wide efforts were only loosely coor- 
dinated. 

Organizationally, CUL staff have 
tended to interpret their roles and re- 
sponsibilities in two dimensions: (1) with 
respect to their function (collection de- 
velopment, technical services, public ser- 
vices, information technology) and (2) 
with respect to the library unit for which 
they work. The conventional framework 
for accomplishing large systemwide pro- 
jects, such as the selection of a new library 
management system, has been to orga- 
nize functional committees, with repre- 
sentatives from various library units, to 
look after each function's and unit's needs. 
Coordination and conflict management 
across functions and units have tended to 
occur at the administrative level. 

The Problem 

The result of the technical, service, and 
organizational environment was a confus- 
ing CUL online presence. Networked da- 
tabases, numeric files, and full-text re- 
sources were available but often difficult 
for users to find. The library was provid- 
ing three different approaches to these 
materials. 

The first of these was Bear Access, a 
site based on Mandarin technology devel- 
oped by Cornell Information Technol- 
ogies, which presented a set of "launch 
pads" and "buttons." The buttons repre- 
sented both specific networked resources 
(which opened specific sessions or re- 
sources), and additional compilations of 



resources — either more launch pads and 
buttons or Web pages that provided links 
to specific networked resources. To dis- 
cover and access a particular resource, 
however, users needed to know either the 
genre (e.g., abstract, index, electronic 
journal, catalog) or where in the hierarchy 
the specific pointer to the resource ex- 
isted. Library staff in the unit libraries 
supporting the humanities, social sci- 
ences, and area studies tended to rely 
most on the Bear Access model for con- 
necting users to online resources. 

The second approach was the Mann 
Library Gateway, which was developed 
by the staff at the Albert R. M ann Library, 
the unit library that supports agriculture, 
biology, nutrition, human ecology, and re- 
lated fields. This gateway began as a 
text-based product but evolved into a 
Web-based product. Its initial goal was to 
provide a search engine that allowed us- 
ers to key in either the name of a resource 
or keyword terms in order to retrieve de- 
scriptions of matching networked re- 
sources. Connections to the resources 
were embedded in the descriptions. 

Finally, unit library Web pages gave 
several unit libraries strong Web pres- 
ences of their own, among them the hotel 
management school, the industrial and la- 
bor relations school, and the graduate 
school of business. Here again, to find and 
use an online resource, the user needed 
to know something about the cubbyhole 
in which the resource was located. 

Importantly, none of these approaches 
encompassed all of CUL's networked re- 
sources. Users at Mann Library were pre- 
sented with the Mann Gateway on the 
public access workstations. Users at most 
of the other libraries were presented with 
Bear Access menus. Although the library 
staff realized that a unified approach — a 
single point of entry to CUL's networked 
resources, or a "common entryway" — was 
needed, there was no consensus on the 
best way to achieve it. 

There was at least one point of agree- 
ment however. With the rapid and contin- 
uing growth of networked resources, ev- 
eryone felt that users should have the 
option to search as well as browse for net- 
worked resources. Library managers and 
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staff therefore desired a catalog that in- 
corporated records for networked re- 
sources with embedded Uniform Re- 
source Locators (URLs), but they 
supported two different options. One was 
to adapt the Mann Gateway technology 
into a CUL-wide service. The second was 
to develop a robust and flexible Web front 
end to the CUL online catalog while at the 
same time creating the capability of 
searching that slice of the library catalog 
that represented networked resources. 

In June 1997, the library administration 
appointed a small committee — the Com- 
mon Entryway Committee — to review the 
two common entryway models; evaluate 
the functions, features, and implementa- 
tion details for each option; and to recom- 
mend a model for CUL. By the end of July 
1997, with the assistance of a number of 
library staff from across CUL, the commit- 
tee submitted its recommendation to the 
library administration for the adoption of 
the Mann Gateway technology and the cre- 
ation of a CUL Library Gateway. 

The Solution 

In late summer 1997, the library adminis- 
tration accepted the recommendations of 
the Common Entryway Committee and 
formed three committees to build the 
Library Gateway over the course of the 
tall semester. In addition, two program- 
mers — a specialist in Hypertext Markup 
Language (HTML), and a specialist in da- 
tabase work, CGI scripts, Javascripts, con- 
nection scripts, and staff interfaces — were 
assigned fofl timeto the project. 

A Public Services Design Committee 
was charged with designing the CUL 
common entryway and associated pages. 
A Technical Services Implementation 
Committee was formed to provide an ap- 
proach for browsing networked resources 
by subject, to develop procedures for gate- 
way record creation and maintenance, and 
to prepare the Library Gateway database. 
To coordinate the work of the two imple- 
mentation committees and provide pro- 
ject oversight, the Library Gateway 
Steering Committee was appointed. 

The steering committee was a cross- 
functional group that included the chairs 



of the two implementation committees, 
the heads of two information technology 
departments in the library (the Library 
Technology Department, which served 
many of the unit libraries, and Mann Li- 
brary's Information Technology Section), 
the associate university librarian for in- 
formation technology, and the head of the 
reference department in the largest CUL 
unit library. While the contributions of all 
committee members were important to 
the process, the core project team con- 
sisted of the chair of the steering commit- 
tee, the chairs of the two implementation 
committees, and the two programmers 
assigned to the project. 

The project was intended to accom- 
plish five objectives: 

• To create a unified, identifiable online 
presence for CUL 

• To create a common entryway to 
CUL's networked resources and ser- 
vices 

• To develop and elicit the use of a com- 
mon set of processes and procedures 
for cataloging networked resources 

• To develop and elicit the use of a com- 
mon set of processes and procedures 
for user training and support 

• To improve coordination between 
CUL's two information technology de- 
partments 

The first meeting of the steering commit- 
tee was in early September 1997. The 
project team adopted an extremely ambi- 
tious implementation time frame, and 
committed themselves to bringing up the 
Library Gateway in 17 weeks (that is, by 
early 1998). All of the subgroups worked 
in parallel, with the steering committee 
providing oversight. The milestones are 
given in table 1. 

The team introduced the Library 
Gateway on schedule on January 5, 1998, 
several weeks before the start of the sec- 
ond semester of the academic year. 

Project Costs 

Library Gateway development from the 
start of the fall semester to the launch re- 
quired about 2,900 hours of staff time, 
which are listed in table 2. 

The gateway had a strong impact on li- 
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TABLE 1 
Gateway Project Milestones 



Snben 



Milestones 



Information Technology 



Technical Services Implementation 
Committee 



Public Services Design Committee 



Steering Committee 



Determined hardware, software, and manpower 
requirements; treated a Web interface to the MySQL, 
database, which supports searching and display of CUL 
networked resources; installed dieTiardware; produced 
software documentation; loaded the Gateway software 
onto the server; trained technical support staff; and 
worked wiLh the two implementation committees to 
create the necessary Web pages, CGI scripts, PERL 
scripts, a staff interface to support database creation and 
maintenance, and other developments. 

Developed a scheme for browsing Gateway resources by 
subject; prepared guidelines for catalog! rig Gateway 
resources and trained CUL eatalogers to use them; 
worked with the information technology staff to test the 
staff interface to the Gateway; and created catalog 
records for the 1,100 Gateway resources. 

Decided on the specific services to be supported on the 
Gateway and created the design for the screens; 
implemented die Web pages and designed the pages that 
are generated by the PERL scripts; and worked closely 
with" the information technology staff to define the final 
look and feel of the Gateway; wrote the content for help 
and informational pages. 

Prepared recommendations with regard to Cornell and 
CUL systems, hardware and software that would be 
affected by the Gateway (e.g.. Bear Access): gave 
numerous presentations on the Gateway to solicit 
feedback from CUL functional groups; oversaw the 
incorporation of staff feedback into the Gateway design; 
drew up plans for public services staff orientation to the 
Gateway; and completed a plan for ongoing Gateway 
maintenance and enhancement. 



brary staff, who had to support the new 
system, which was not a trivial addition to 
their usual tasks. Training library staff 
meant involving most of the organization 
in maintaining the gateway on some level. 
Prior to the launch, a task force for gate- 
way training and user support was 
formed, and the chair became a member 
of the steering committee. This group put 
in place a structure and process for han- 
dling questions that arrived at reference 
desks and via e-mail using the "Reference 
Question?", "Problem Connecting?", and 
"Comments about the Gateway?" links at 
the bottom of most gateway pages. The 
task force also designed and carried out 
numerous library staff and user training 
sessions. This group's work had the dual 
benefits of preparing staff and users for 
the gateway and of widening the circle of 
stakeholders in the gateway's future. 



The Process: What Was Learned 

To maintain the library's attractiveness to 
readers and its centrality to the university, 
staff at CUL have committed themselves 
to offering efficient, quality access to on- 
line resources of interest to their users . To 
that end, a surprising new organizational 
competency is required: CUL must be 
the sort of organization that can design 
and bring up new online systems and ser- 
vices swiftly and effectively Further, 
CUL is faced with mastering the process 
of innovation and new product develop- 
ment in general — not unlike organiza- 
tions in the commercial sector. 

Researchers in the area of new prod- 
uct management (see, for example, 
Crawford 1997; Cooper 1993; Cooper 
and Kleinschmidt 1993) suggest that the 
ability to innovate and deliver a steady 



LRTS • 43(2) • Notes on Operations /119 



TABLE 2 
Gateway Project Costs 



Group Staf f Hours (Estimate) 

Information Technology Staff 1,350 
Technical Services Implementation Committee 650 
Public Services Design Committee 625 
Steering Committee 275 
Total 2,900 



stream of new products is essential to an 
organization's growth and long-term via- 
bility. Along the same lines, reduced cycle 
time — speed to market — appears to be 
central to an organization's continued 
success in its market. The experience with 
the Library Gateway project indicates 
that there may be some key ideas to suc- 
cess for introducing new products in 
large, organizationally complex organiza- 
tions such as CUL. 

The first key idea is to start with a 
well-developed, clearly defined concept 
of the new product (which could be a new 
system or service, like the gateway). Make 
sure that management and those who 
must implement the new product under- 
stand and buy into the concept. The Li- 
brary Gateway group was fortunate to be- 
gin its work with a well-developed 
prototype based on the Mann Library 
Gateway. And, thanks to decisions by the 
library administration in response to the 
work of the Common Entryway Commit- 
tee during the summer of 1997, the im- 
plementation groups knew they had the 
unqualified support of CUL's leadership. 

The gateway prototype and its later it- 
erations were valuable communication 
tools and guides for various individuals 
and functional groups in the CUL system 
that needed to buy into the product con- 
cept. As a result, everyone involved was 
reasonably clear about what the Library 
Gateway would be from the beginning of 
the project. Without such sharp, early 
definition of the gateway, the implemen- 
tation groups might have faced vague or 
moving targets, or failed to gain the nec- 
essary acceptance, and the project might 
have taken too long — or worse, have not 
been completed at all. 



The second key idea is to undertake a 
new product development project in a set 
of simultaneous, overlapping activities, 
rather than developing in phased, se- 
quential stages. After the selection of a 
model for the Library Gateway, all of the 
implementation groups were formed at 
once and charged with various aspects of 
the work. Interface design occurred con- 
currently with gateway database develop- 
ment, technical development, staff train- 
ing, and marketing (in the sense of 
gaining awareness and support from 
those who would be affected by the gate- 
way). The process might have looked cha- 
otic from the outside but it was not. It re- 
quired a great deal of cross-functional 
communication, tolerance for ambiguity, 
management support, and project team 
commitment to the project, but the con- 
current operations enabled the imple- 
mentation groups to deliver the gateway 
to library users in only 17 weeks. 

The third key idea is to break down or- 
ganizational barriers by using cross- 
functional teams that are interdependent 
and accountable for the project. The 
Library Gateway Steering Committee 
was a cross-functional group that in- 
cluded administrators and staff from vari- 
ous functional areas of the library system 
(information technology, public services, 
and technical services). The chairs of the 
two implementation subcommittees 
(which were not cross-functional in 
nature) were part of the steering commit- 
tee. The steering committee was effective 
because members were committed to the 
project; diverse ideas and open communi- 
cation were encouraged, several team 
members shared initiative, and team re- 
sources were identified and used to good 
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purpose. Without this cross-functional 
structure, the project could have experi- 
enced lengthy delays and foul-ups due to 
the inherent difficulties of passing off re- 
sponsibility in sequential phases from one 
compartmentalized functional group to 
the next. 

The final key idea is to seek continual 
feedback from stakeholders, be respon- 
sive to their concerns, and build organiza- 
tional consensus as the project pro- 
gresses . The gateway group did its work in 
the open. While speed was one of the re- 
quirements of the project, the team did 
not allow the gateway project to gather 
uncontrolled momentum. In other words, 
as the gateway was developed, team 
members proactively identified stake- 
holders — people who needed to know 
about the project, people who would have 
to support the gateway once imple- 
mented, and people whose approval was 
needed — and organized demonstration 
sessions to gather their feedback and ad- 
vice, uncover and deal with resistance, 
and gain stakeholders' acceptance of the 
system. Several times during the course of 
the project, developers were able to pro- 
totype changes to the system within a day 
or two of receiving a suggestion in one of 
the demonstration sessions. This respon- 
siveness did a great deal to enhance the 
project team's credibility and build confi- 
dence in the product. 

There is evidence that cross- 
functional teams can produce dramatic 
benefits, such as rapid project comple- 
tion, more innovative services, better 
quality, and improvements in productiv- 
ity. Unfortunately, simply creating a 
team and promoting teamwork does not 
appear to be suff icient to reap these ben- 
efits. For true teamwork to materialize, 
the team should be the right size, its 
members must be committed to (and ac- 
countable for) the project, and the team 
must have sufficient autonomy and ade- 
quate resources to accomplish its work. 
Further, there is evidence that effective 
teams rely on shared decision making, 
consensus building, and frequent com- 
munications with stakeholders to man- 
age organizational boundaries and over- 
come obstacles (see, for example, 



Ranney and Deck 1995), 

Except as stakeholders, CUL collec- 
tion development and acquisitions staff 
did not participate in the design and 
implementation of the gateway. This 
turned out to be a weakness in the pro- 
ject design. Toward the end of the 
project, the steering committee con- 
ducted a post-project evaluation, and 
recommended the creation of a cross- 
functional Electronic Resources Com- 
mittee with responsibility for policy and 
procedural decisions associated with se- 
lecting and incorporating new net- 
worked resources into the gateway. 

Ry summer 1998, the initial steering 
committee, the two implementation com- 
mittees, the task force on training and 
user support, and the rest of the project 
team had disbanded. The steering com- 
mittee's final action was to produce a 
"mainstreaming document" in which, 
among other things, it was recommended 
that a successor group be appointed — the 
CUL Gateway Committee — that comple- 
ments the work of the Electronic Re- 
sources Committee. The Gateway Com- 
mittee is made up of a coordinator, the 
gateway Web editor, the editor of the 
guidelines for cataloging networked re- 
sources, the associate university librarian 
for library information technology, a tech- 
nical specialist, a public services librarian, 
and the chair of the Electronic Resources 
Committee. The CUL Gateway Commit- 
tee's charge is to ensure that the gateway 
continues to evolve to best serve the 
needs of CUL and its users. 

Benefits of the Gateway 
Benefits for Users 

With the creation of the gateway, Cornell 
University built a virtual branch library 
on the Internet. By pulling all network ac- 
cessible resources, services, and library 
information together into a single inter- 
face, CUL created an all-inclusive Web 
presence. Users have a single system to go 
to instead of having to know ahead of time 
what they need and where they can find it. 
Using the library has become easier, and 
networked resources have become more 
visible. At the same time, the gateway 
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supports and points to the Web sites of 
unit libraries, thus highlighting and tak- 
ing advantage of the talent, initiative, and 
diversity of these libraries. 

The gateway was released in early Jan- 
uary 1998, in the middle of the academic 
year. To make the transition to the new 
system easier the project team decided to 
maintain the old-style access for a semes- 
ter, in tandem with the gateway. Despite 
this dual environment, the gateway re- 
ceived heavy use all over campus. In Janu- 
ary 1998, there were 12,106 connections; 
one month later, total connections had 
jumped to 26,610. In September 1998, to- 
tal connections were 48,105. 

Another major benefit is the fact that 
the gateway has made remote use of CUL 
resources possible. Because of the au- 
thentication and authorization module of 
the system, it is not necessary to limit ac- 
cess to most of the CUL licensed data- 
bases to Cornell-based computers only. 
Now there is a way to screen users coming 
from non-Cornell machines and deter- 
mine who is authorized to have access. 
This is a major benefit for travelling 
Cornellians. 

Organizational Benefits for CUL 

Working on the implementation of the 
gateway was a shared project across the 
unit libraries and across functional 
groups. Working together on a common 
project of this complexity and on such a 
short timeline was a very important step 
towards creating greater understanding 
and cooperation between functional 
groups and among the unit libraries. The 
process affected not only the key players; 
along the way the gateway project team 
had to communicate with the wider li- 
brary community regularly and build its 
feedback and ideas into the system. The 
openness and responsiveness of the pro- 
cess helped build trust and acceptance 
for the gateway in library staff, some of 
whom were quite reluctant and even re- 
sentful of the change at the beginning of 
the project. 

Even after the implementation, the 
gateway project continues to foster dialog 
and cooperation. The maintenance of the 



system requires constant input from and 
dialog between different functional units 
of the library (collection development, 
acquisitions, cataloging, public services, 
and information technology) and be- 
tween the main and unit libraries. Open- 
ness to suggested improvements and crit- 
icism continues to be important. 

The library's investment in its online 
resources is half a million dollars a year. 
Making these resources more readily ac- 
cessible and more visible maximizes the 
benefits of this substantial investment. 

The Future 

The library administration has funded a 
series of focus grojigs to provide CUL 
managers and sSff with more feedback 
on the system and more general infor- 
mation about what features the users 
value and need. The results of this study 
should prove useful for the develop- 
ment of future generations of the gate- 
way as well. 

Summary 

While the Library Gateway is an impor- 
tant technological achievement, what 
might be most helpful to other institu- 
tions is an understanding of the process 
used to build it. First, the experience with 
using teams was an excellent one. The 
cross-functional steering group and its 
two subgroups were able to cut across tra- 
ditional organizational boundaries, to 
work and respond quickly, and to build 
consensus continuously. Second, but 
equally important, was the library man- 
agement's support. By providing a clear 
mandate and strong sponsorship while 
still giving the team sufficient autonomy, 
CUL's leaders established the conditions 
in which innovation, teamwork, and ex- 
cellent service could emerge. Third, the 
critical importance of involving all func- 
tional groups in the task of organizing 
electronic resources in libraries was dem- 
onstrated. In the face of the dramatic shift 
brought about by the Web, it is essential 
to integrate library functions effectively, 
and to make the most of what each group 
knows and does best. 
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The DDC, the Universe of Knowl- 
edge, and the Post-Modem Library. 

Francis L. Miksa. Albany, N.Y.: Forest 
Press, 1998. 99p. $5. (ISBN 0-910608- 
64-4). LC97-41719. 

Why is the DDC (Dewey Decimal 
Classification) structured the way it is? 
What is its relationship to theories of clas- 
sification of knowledge and of library clas- 
sification? How has it evolved over the 
years, and what are its prospects as we en- 
ter the age of the "post-modern library"? 
These are some of the questions Francis 
Miksa discusses in this interesting study, 
an expansion of his DDC Anniversary Lec- 
ture at the Fourth International ISKO (In- 
ternational Society for Knowledge Organi- 
zation) Conference in July 1996. 

The DDC is one of the most successf ul 
library classification systems in the world, 
as evidenced by its widespread use in li- 
braries in die United States and in many 
other countries and by its growth and evo- 
lution over more than 120 years. But the 
history of the DDC has not followed a 
straight line. Its development has been af- 
fected by the personal beliefs of founder 
Melvil Dewey and of the various editors 
who came after him, by changing theories 
of library classification, and by practical 
considerations. Numerous controversies 
have arisen along the way; for example, the 
radical changes in the fifteenth edition in 
1951 are blamed in part for the move by 
many academic libraries from the DDC to 
the Library of Congress Classification. 

The DDC was a product of its age. 
Dewey developed it in the 1870s, when 
the larger scientific and philosophical 
community was greatly interested in the 
classification of all knowledge. However, 
library classification always has had a 



practical aspect — the DDC is not just a 
classification of knowledge; it is used to 
classify actual books in libraries of various 
sizes and types. There always have been 
conflicts over how many levels of hierar- 
chy to include in the Dewey classification 
scheme and over how much it should be 
revised to keep up with changing knowl- 
edge, given the need for integrity of class 
numbers to reduce the work of reclassify- 
ing books in libraries. 

In 1899, the Library of Congress 
wanted to adopt the DDC, but with fun- 
damental changes that Dewey refused. 
LC then went on to develop its own very 
different classification scheme. Also at 
the turn of the century, the new Institut 
International de Bibliographie in 
Brussels set out to catalog and classify sci- 
entific literature. Paul Otlet and Henri 
LaFontaine began by expanding on the 
DDC, but the specialized nature of the 
materials with which they were dealing 
and the particular needs of the scientists 
and scholars they served led to the devel- 
opment of the Universal Decimal Classi- 
fication (UDC). Their work led to a new 
idea of subjects, one that was much more 
complex and more difficult to contain in a 
strict hierarchy. Miksa also details the 
contributions of Ernest Cushing Richard- 
son, Henry Evelyn Bliss, W.C. Berwick 
Sayers, and S.R. Ranganathan to library 
classification theory in the early parts of 
the twentieth century, when the old view 
of the universe of knowledge as a 
one-dimensional, hierarchical structure 
was replaced by a more complex, modu- 
lar, and faceted view. 

Since the 1950s, these theoretical 
developments have had an impact on the 
DDC in its major efforts to keep the ter- 
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minology up-to-date and to improve sub- 
ject collocation, as well as in increasing 
use of subject faceting and notational syn- 
thesis. However, Miksa finds that the 
DDC is still based on three assumptions 
that, if unexamined, could have a negative 
effect on its future and on the future of li- 
brary classification in general. These as- 
sumptions are: "knowledge categories are 
by nature hierarchical and logical; one 
best classification system is achievable; 
and document retrieval is the main pur- 
pose of library classification" (p. 82). 

Only in his conclusion does Miksa con- 
sider the "post-modern library," by which 
he means both an evolving concept of the 
library as a personal space, made possible 
by electronic information and modern tele- 
communications, and the library as a prod- 
uct of the post-modern age, in which 
knowledge and truth are no longer absolute 
and everything is relative. The best hope for 
the survival and continued relevance of the 
DDC in this age lies in a thorough re- 
conception of the system, including vari- 
able levels of specification and alternative 
arrangements for collocating information. 

This slim book will interest readers who 
are curious about the history of libraries 
and the classification of knowledge. Its rel- 
evance is not limited to libraries using the 
DDC or to classifiers. It is very well orga- 
nized, though it may provide a little too 
much detail on the history of the various 
editions of the DDC and not enough in- 
sight into possible future developments. A 
nine-page reference list is included. 
— John Hostage (hostage© law. harvard, 
edu), Authorities Librarian, Harvard Law 
School Library, Cambridge, Mass. 

Communicating Research. A. J. 

Meadows. San Diego, Calif.: Academic 
Press, 1998. 266p. $59.95. (ISBN 
0-12-487415-0). LC97-23432. 

In Communicating Research, A. J. 
Meadows begins by stating that "Commu- 
nication lies at the heart of research . . . for 
research cannot properly claim that name 
until it has been scrutinized and accepted 
by colleagues" (p. ix). This is obvious, but 
perhaps too easily taken for granted. 
Throughout this book, Meadows reiter- 
ates that understanding results from 



communication — from the interaction 
between the researcher and the receiver 
of research information. Research com- 
munication continues to evolve, most re- 
cently with the introduction of electronic 
networks — Meadows summarizes his 
main theme as "change and diversity" (p. 
239). In his view, changes in the research 
communication process are not driven 
solely by technology, but also by the 
needs of the research community. He 
recognizes the effects of the technology, 
but considers it in its proper place as a 
tool of the researcher. 

This book provides a good overview of 
the history and evolution of research com- 
munication, with a concise timeline that 
proceeds from Aristotle's symposia to the 
invention of printing, the development of 
postal systems, and the rise of the modem 
research journal as learned societies found 
meetings, personal correspondence, and 
books inadequate to keep a growing audi- 
ence abreast of expanding research. 

The emphasis in Communicating Re- 
search is on academic research, though 
private industry and government-funded 
research are included as a basis for com- 
parison. Meadows focuses mainly on re- 
search in the natural sciences and in- 
cludes humanities and social sciences 
research when he touches on the evolu- 
tion of these distinct fields and the diffi- 
culty in defining the boundaries between 
them. A general characterization of the 
differences between the humanities and 
social sciences versus the natural sciences 
is that in the former, the book is more 
widely used as a tool of communication, 
while the journal article is most common 
in the latter. In addition, the emphasis in 
the natural sciences is almost always on 
the most current research, while older 
knowledge is read and cited more often in 
the humanities and social sciences. The 
structure of the journal article contrib- 
utes to the efficiency of the communica- 
tion process by providing the expected 
title, author, date of receipt (essential for 
establishing "first" discoveries, especially 
in the natural sciences), abstract, body 
(usually introduction, methodology, re- 
sults, conclusion), and list of references. 
Books also have a typical layout, with an 
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index to assist researchers in locating and 
absorbing information quickly. 

The "professionalization" (p. 24) of re- 
search began in German universities, was 
well-established by the second half of the 
nineteenth century, and then spread to 
the rest of academia and eventually to 
industry and government. The size and 
education level of the research commu- 
nity began to grow, along with the amount 
of information it generated. Meadows 
makes the interesting point that the feel- 
ing of being overloaded with information 
actually has been common for a long time. 
To cope with the expansion of knowledge, 
universities were organized into depart- 
ments and new disciplines were created 
through specialization or fusion. 

The research community is subject to 
a set of social norms that are reflected in 
the communication process. When these 
norms are followed, they are invisible, but 
when they are not, plagiarism and the 
forging of research data may result. The 
community typically experiences periods 
of "normal science" interspersed with pe- 
riods of "revolution," all documented 
through the recognized communication 
channels that record the cumulative 
knowledge of a discipline. Meadows dis- 
cusses various aspects of research, includ- 
ing who performs research and why, the 
lower visibility of women researchers and 
those from developing countries, the ef- 
fect of age on the researcher's output, the 
involvement of multiple authors in writ- 
ing results, citation studies, and the roles 
of editors, referees, the mass media, and 
even amateur researchers. He highlights 
current trends, such as the increasing em- 
phasis upon general, theoretical research 
as opposed to specific, empirical research 
(though this varies by discipline) and the 
increase in collaborative and interdisci- 
plinary efforts. 

The effects of technology on research 
communication are stressed throughout 
this book. Meadows points out that tech- 
nological manipulation of bibliographic 
data helps researchers locate informa- 
tion. At first, only bibliographic citations 
were searchable by computer, then ab- 
stracts, and now full-text. He credits in- 
formation technology with opening up 



new research opportunities and regular 
communication outside the research 
hierarchy, making the research process 
more democratic and open. Electronic 
networking allows wide dissemination of 
research in progress. On the other hand, 
these newer, more flexible channels bring 
a number of challenges: lines are blurred 
between formal and informal exchanges 
(when is something "pre-" and when is it a 
"print"?), questionable material can go 
undetected by the novice reader, data 
tampering can occur, and "electro- 
copying" can jeopardize copyright pro- 
tection. There are citation difficulties, 
uncertainties regarding long-term archiv- 
ing, and a continuing reluctance to accept 
electronic publishing within the aca- 
demic community. Meadows endorses an 
ideal scenario in which print and elec- 
tronic publication are parallel. 

Though it is not the major focus of this 
work, Meadows recognizes the role of 
publishers and libraries in the research 
process. Publishers work with authors to 
assure that information is suitable for pub- 
lication, that it is produced in an appropri- 
ate, legible format, and that it is 
well-disseminated. Libraries are the pri- 
mary purchasers of scholarly publications; 
their selection decisions affect the actions 
of both publishers and readers, and these 
decisions are in turn often driven by the 
immediate needs of the institutions own 
researchers. The library's role is to archive 
the records of research and to make them 
available, a role that brings with it dilem- 
mas regarding the financial inability to 
keep pace with publishing and the rising 
cost of physical storage. Both libraries and 
publishers face new challenges with elec- 
tronic publishing, though Meadows be- 
lieves that libraries have the most difficult 
task — storing and providing access to both 
the old and the new. 

Electronic journals gained popularity 
first in the humanities and social 
sciences — articles are composed mainly 
of text, and publishing space was needed 
that was unavailable in print. The natural 
sciences are catching up now that tech- 
nology can support the graphics required 
for publications in this field. Researchers 
at present recognize that a mix of print 
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and electronic documents is the reality of 
the mid-1990s, but Meadows advocates 
more retrospective conversion to elec- 
tronic format in order to expedite re- 
searchers' work. Paradoxically, informa- 
tion technology has made the researcher's 
task more difficult by increasing the 
amount of information, while at the same 
time making it more convenient for each 
researcher to access that information from 
a personal workstation. Meadows cautions 
that these electronic channels may further 
divide the information-rich from the infor- 
mation-poor, based upon the degree of ac- 
cess to information technology. 

Many of the author's points are illus- 
trated by tables with data on everything 
from "A Comparison of the Number of 
Articles Devoted to Astronomy/Space 
and Medicine at Two Epochs" (p. 73) to 
"The Existence of Bias in Refereeing 
Judgments" (p. 190). The concepts that 
Meadows presents are so interconnected 
and so often repeated that 1 found myself 
pondering a better organization of this 
work while I read it. Yet in the end, I real- 
ized that this is the point: the communica- 
tion of research is a complex, chaotic pro- 
cess that is constantly changing. The 
choice of the term "Postscript" for what I 
first considered a conclusion or summary 
seemed odd, but it became clear from the 
closing sentence that the author's clever 
play on the word "post-script" (p. 242) was 
intentional. Meadows correctly observes 
the myriad questions raised by the elec- 
tronic channels of communication now 
available to researchers, although he un- 
fortunately does not seem to have any 
greater insight than the rest of us into how 
it will all turn out. — Ellen McGrath 
(emcgrath@acsu.hujfalo.edu), Head of 
Cataloging, Charles B. Sears Law Library, 
State University of New York at Buffalo 

Budgeting for Information Access: 
Managing the Resource Budget for 
Absolute Access. Eds. Murray S. 
Martin and Milton T. Wolf. Chicago: 
American Library Association, 1998. 
174p. $30 (ISBN 0-8389-0691-5). 
LC98-10964. 

In Budgetingfor Information Access, 



Martin and Wolf define access very 
broadly as "finding, looking at, or using 
any printed or electronic information" (p. 
20). All types and formats of accessible 
collection-related materials, from print to 
electronic media, are discussed and ex- 
amined in a series of short chapters. 
These include materials that are directly 
retrievable on-site in traditional and 
electronic formats and those received 
through interlibrary loan or document 
delivery services. Both editors are associ- 
ated with Technicalities, a library journal 
and forum that represents the practitio- 
ner's point of view. Martin, who died in 
April 1998, was the author of the 
"Money Matters" column in Technical- 
ities and many articles and books on 
budgets and library collections. Milton 
T. Wolf is the f ounding editor of Techni- 
calities and is currently vice president 
for collection programs at the Center for 
Research Libraries. 

Most of the chapters in Budgetingfor 
Information Access contain practical infor- 
mation about collection-related concepts 
such as preservation, pricing, collection or- 
ganization and maintenance, and budget 
implications. The "Notes" section in each 
chapter and the selective bibliography at 
the end of the book contain current and 
relevant source material, although the 
mention of some of these publications and 
studies is repeated too often throughout 
the text, as are material and charts from sev- 
eral of Martin's previous works. 

The strength of this book lies in its dis- 
cussion of budgeting and of the wide 
range of information alternatives avail- 
able to libraries in the current market- 
place. These alternatives include other 
collection-related areas — beyond print 
and electronic access — such as document 
delivery, interlibrary loan, resource shar- 
ing, and consortial and contractual rela- 
tionships. This is important reading for 
acquisitions librarians whose functions 
overlap increasingly with areas outside 
their traditional domain, i.e., borrowing, 
leasing, and procurement for individuals, 
rather than purchasing materials to be 
cataloged and processed for the library 
collection. The thoughtful sections on de- 
veloping a resource budget and on budget 
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scenarios cover an important and often 
overlooked area of library management. 

Budgeting for Information Access is 
less successful in its discussion of elec- 
tronic resources. Problems associated 
with ownership and preservation in the 
electronic arena, as well as the economic 
consequences of acquiring and licensing 
these materials, are all important issues 
that many librarians confront. This book's 
heavy-handed bias toward print materials 
can be justified in some circumstances; 
nevertheless, statements that compare 
and contrast print and electronic re- 
sources often are confusing and overly 
dramatic as indicated in the following 
statements: "Of particular importance is 
the difference between accessing a book 
and accessing electronic media . . . be- 
cause the first is a simple matter of pick- 
ing up something that has already been 
purchased and is therefore subject to the 
Doctrine of First Sale, while the second is 
like accessing a datafile that is not in the 
public domain"(p. 119) and "Unless pro- 
viders and users can find more common 
ground, the unlimited promise of the new 
medium may diminish or vanish" (p. 88). 
In addition to commercially available 
products, which are emphasized in the 
text, there are thousands of govern- 
ment-maintained, content-rich Internet 
sites, accessible from the home or library, 
which provide information directly to the 
public. Electronic resources vary greatly in 
format, access, usage, content, and quality; 
in this text, their distinctions are some- 
times neglected in favor of broad editorial 
statements on electronic resources in gen- 
eral. Peggy Johnson (1998) and Ross 
Atkinson (1998) deal with these issues 
more effectively in an effort to inform and 
improve library practice and theory. Dis- 
cussions of copyright and contract licens- 
ing are scattered throughout the text; indi- 
vidual sections devoted to these issues, 
with specific guidelines, wouldbe helpful. 

Over the last decade, in an effort to 
meet users' information and research de- 
mands, libraries have attempted, within 
budgetary constraints, to provide access 
to a wide range of services and materials. 
Martin and Wolf's book is a good starting 
point for a better understanding of "ac- 



cess" in an ever-changing library environ- 
ment. — Amy Dykeman (amy _dykeman@ 
solinet.edu), Library Products and Ser- 
vice Manager, Electronic Information 
Services, SOLINET, Atlanta, Ga. 
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Practical Digital Libraries: Books, 
Bytes, and Bucks. Michael Lesk. San 
Francisco: Morgan Kaufman, 1997. 
297p. $49.95. (ISBN 1-55860-459-6). 
LC97-22069. 

Michael Lesk is a computer scientist 
who has shown his interest and expertise 
in library-related issues in research for 
the Commission on Preservation and Ac- 
cess and who has dedicated his work to 
studying problems and solutions for the 
electronic library. He has written several 
respected works on preservation and 
technology; but this book, a natural exten- 
sion of his interests and experience, is by 
far the most extensive and will reach the 
broadest audience. 

Practical Digital Libraries is one of the 
first books to treat this subject so compre- 
hensively and in such a readable style that 
students, educators, librarians, and com- 
puter scientists all will find it interesting 
and valuable. The subtitle "Books, Bytes, 
and Bucks" indicates the book's attention 
to the economics of digital library projects. 
This topic is critical in a discussion of 
changes in technology and how to mobi- 
lize, plan, and transfer library functions 
and resources to a digital environment. 

Lesk provides a thorough review of 
what a digital library is and how to build 
one; the book can easily serve as a reader 
or textbook, and it would also be useful in a 
study of trends in scholarly communica- 
tion. In the opening chapters, Lesk fo- 
cuses on the technology of conversion and 
construction and the needed equipment 
and software. He examines techniques for 
storing and manipulating images and for 
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storage of sound and multimedia, and pos- 
sibilities for classification and indexing. A 
discussion of options for delivery of the dig- 
itized content and for information retrieval 
is central in this book. Lesk concludes with 
discussions of collection, maintenance, and 
preservation of digital information and a 
review of economic models to sustain the 
development of digital libraries. The practi- 
cal and well-organized nature of this vol- 
ume, and its numerous illustrations and 
graphical presentations of complicated in- 
formation, make it a useful resource for 
anyone involved in the developing digital 
library environment. Valuable reader- 
friendly features are the extensive list of 
figures, a bibliography of references, and a 
useful index. 

Lesk tackles the "whys, hows, and what 
decisions remain to be made" (p. ixx) as 
new technology makes digital libraries 
more a reality at all economic levels. The 
author underscores how library users have 
come to expect to be able to access, search 
for, copy, and store information with maxi- 
mum ease and dependability, using com- 
mon equipment that doesn't require com- 
plex skills. The outcome of a digital library 
is a new sense of organizational culture — it 
is this social aspect of computing and the 
transformed library environment that is so 
exciting to study and experience. This 
book is a success as a planning tool, an anal- 
ysis of how to problem-solve in planning 
various aspects of a digital library. In build- 
ing a case to answer the question "Why 
digital libraries?", Lesk presents dilemmas 
that can be studied and solved in a variety 
of ways. Examples of these problem situa- 
tions include text formats; methods of 
transferring images and text so that they 
can be read and copied; new options for 
multimedia storage and access; methods of 
knowledge representation; distribution of 
content and security concerns; and usabil- 
ity and retrieval evaluation. The conclud- 
ing summary in each chapter includes both 
a summary of the chapter's content and 
ideas on future developments in this dy- 
namic environment. 

The second part of the book is less 
technical and more application-oriented, 
covering collections and preservation, 
economics, copyright and intellectual 



property, the scope of international activ- 
ities, and future impact on society and 
public policy. Lesk's almost encyclopedic 
approach gives direction to the technical 
planning process. He is candid and hon- 
est, identifying genuine problems and 
solutions that obviously have limitations 
and will need to be revisited when new 
technologies become available. 

As a librarian, I found the chapters 
"Economics," "Knowledge Representation 
Methods," and "Collections and Preserva- 
tion" most useful. The understanding of 
technicalities gleaned from the earlier 
chapters can be applied to these discussions 
of planning for digital access. The informa- 
tion is useful to planners from different 
professional avenues, for small and large 
projects, and in various higher education 
environments. There is a good introduc- 
tion to government-funded research ini- 
tiatives that nonexperts will find helpful. 

This book will have a reasonable 
shelf-life in libraries because it fills avoid. 
Until it became available, librarians had 
to study computer science and engineer- 
ing manuals to learn about the technology 
of digitization. The 1997 imprint should 
not be a concern because there is little 
else published of comparable scope in so 
intelligent and readable a volume. Many 
earlier and later works cover aspects of 
this subject in greater detail, but as a sin- 
gle-volume work on this subject, Practical 
Digital Libraries stands alone. A newer 
and somewhat related work on the same 
subject, without the technical informa- 
tion, is The Mirage of Continuity: Recon- 
figuring Academic Information Re- 
sources for the 21st Century, edited by 
Brian Hawkins and Patricia Battin (Wash- 
ington, D.C.: Council on Library and In- 
formation Resources, 1998). The World 
Wide Web is also a source of information 
about digital libraries. The D-lib Pro- 
gram, Research in Digital Libraries 
(http://www.dlib.org), provides a variety 
of useful resources, including D-Lib 
Magazine, but it is nowhere near as easy 
to use or as complete an overview as Prac- 
tical Digital Libraries: Books, Bytes, and 
Bucks. — Julia Gelfand (jgelfand© sunl. 
lib.uci.edu), Applied Sciences Librarian, 
University of California, Irvine. 
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ALCTS #1 
Business of Acquisitions 
Advancing Acquisitions: 
Services, Standards, Skills 

Thursday, June 24, 8 a.m.-5 p.m. 
Friday, June 25, 8 a.m.-noon 

Acquisitions departments and acquisitions li- 
brarians face a variety of new demands, such 
as managing electronic products, outsourcing, 
and responding to shrinking resources. This 
advanced institute examines these demands 
and challenges for continuing growth in com- 
bination with resources and abilities available 



^LCTS Members $175 
KLA Members $225 
includes ALCTS membership) 
•"Jonmembers $275 




ALCTS #2 
Bringing Monographs into the 
Digital Library 

Friday, June 25, 8 a.m.-5 p.m. 

This preconference will explore issues related 
to the incorporation of eMonographs/eBooks 
into the digital library, presenters will define 
the eBook; identify its the significance in the 
digital library; address eBook selection, acqui- 
sition, organization, distribution, copyright, 
preservation, and perpetual access from the 
librarian's perspective; and address key issues 
of importance to librarians and publishers ir 

ALCTS Members $11; 
ALA Members $16 
(includes ALCTS membershif 
Nonmembers $21.: 



For more information please go to the ALCTS website at 
http://www.ala.org/alcts/events/precoiiference99.html or contact 
Yvonne A. McLean, 1-800-545-2433, ext. 5032 or ymciean@ala.org 



3 Ways to Register 

On Line- Complete the on-line registration form located at 

http;// www. a Ja.org/events/ac99/on Ijne_reg.html 
By Fax- If you pay with a credit card you may fax your completed registration 

form 24 hours a c dialing 312-944-7841 

By Mail- Send your completed registration form with payment to: 

American Library Association, Dept #77-6565, Chicago, XL 60678-6565 

y find the registration form on the ALA website or in the 
January or April issues of American Libraries 




Build the Asian language 
collections your users want 
with the oclc asialink and 

TechPro SERVICES 



Developing Asian language collections is easy with the OCLC Asialink service. 
Staffed by Asian language experts,AsiaLink offers everything you need to establish, 
maintain and expand Chinese, Japanese, Korean and Vietnamese (CJKV) 
collections in your library. 



The AsiaLink Collection Development service lets you 
determine the specific subject matter and format of 
materials that AsiaLink staff will acquire on your behalf, 
based on your users' needs. You'll also get significant 
savings on the cost of books, including higher-than- 
average discounts from book vendors. 

AsiaLink Sets get you off to a quick start or allow you 
to easily expand existing collections. Books are avail- 
able in sets of 25, 50 or 100 pre-selected fiction and 
nonfiction titles. Both juvenile and adult sets are 
available. Title lists are updated annually, so it's easy 
to add to your collection regularly. • 

Full-level OCLC cataloging is provided for all titles and 
materials may be physically processed upon request. 



#4 

SACH 



For help cataloging titles not acquired 
through AsiaLink, turn to the OCLC 
TechPro service. TechPro provides 
expert cataloging for materials in all 
bibliographic formats and in many languages 
(including CJKV), plus optional physical 
processing. TechPro can help you quickly 
and efficiently eliminate backlogs and 
keep your cataloging up-to-date, giving 
your users prompt access to materials. 

For more information on AsiaLink or TechPro, contact Marcia Stout at OCLC. 

1-800-848-5878, ext. 4386 614-764-4386 Fax: 614-718-7277 
E-mail: marcia._stout@oclc.org. 




www.oclc.org/oclc/menu/col.htm 




